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Symposium: Current Aspects of the Problem of Validity’ 
Suggestions For the Delineation of Validational Studies 


EpwIN S. SHNEIDMAN 
V. A. Center, Los Angeles 


This symposium on “Current As- 
pects of the Problem of Validity” is 
sponsored both by the Division of Clin- 
ical Psychology and the Society for 
Projective Techniques. Our topic re- 
lates to one of the most timely issues 
that might have been selected by 
either group. It is one that is impor- 
tant to our practice and fundamental 
to our science. As chairman, my own 
contribution to this symposium is one 
that might be called “A Fugue and 
Variations on a Four-Part Invention 
by Cron-bach,” and is essentially a 
suggested modification of the types 
of validity formulated by Cronbach’s 
APA Committee on Test Standards 
in 1952. 

One hardly needs to reiterate that 
the APA Committee on Test Stan- 
dards proposed four types of validity, 
as follows: 

1. “Predictive validity, which de- 
notes correlation between test meas- 
ures and subsequent criterion 
measures. 

2. “Status validity, which denotes 
correlation between the test and con- 
current external criteria. 

3. “Content validity, which refers 
to the case in which the specific type 
of behavior called for by the test is 
the goal of teaching or some similar 
activity, as in an academic achieve- 
ment test. 

4. “Congruent validity which is es- 
tablished when the investigator dem- 
onstrates what psychological attribute 
a test measures by showing correspon- 
dence between scores on a test and 
other indications of the state or attri- 
bute. This type of validity is used for 
tests intended to measure a construct 
*Symposium presented at the American Psy- 


chological Association Convention, Wash- 
ington, D. C., August 30, 1958. 


arising from some theory; the valida- 
tion consists of evidence that the 
scores vary (from person to person or 
occasion to occasion) as the theory 
would imply. Essentially, in congruent 
validity the meaning assigned to test 
scores is substantiated by demonstrat- 
ing that scores are consistent with de- 
ductions from the theory from which 
the meaning derived. This validation 
process is much the same as that in- 
volved in evaluating a theory itself.” 

My own suggested modification is 
essentially a spelling out of the sev- 
eral dimensions involved in any vali- 
dational study. I think that I might 
go so far as to propose that each of 
these be indicated at the beginning of 
any validational study. The headings 
I suggest are as follows: 

1. Validity for whom, in which (a) 
the subjects, (b) the examiners, and 
(c) the judges would be described. 
It makes a lot of difference, for ex- 
ample, whether the 700 Rorschachs 
being reported in a normative match- 
ing study were given to Columbia 
sophomores or Columbian peasants; 
whether the examiner was an ABEPP 
grandfather or his comely young 

randdaughter, and whether the 
judges were tyros or Tyroleans. This 
notion is, of course, a sort of tautology 
in that it begs the question of what 
are the relevant dimensions in terms 
of which the subjects, examiners, and 
judges need to be described. 

2. Validity for when, in which one 
(or more) of the three logical tem- 
poral positions would be indicated: 
past, present, and future. Correla- 
tions with future criteria are, of course, . 
called prediction, and with past cri- 
teria are called postdiction,. For corre- 
lations with more-or-less present cri- 
teria (such as “present status”) I 
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have suggested the term paridiction, 
from the word “equal”. 

3. Validity for which. Under this 
rubric one should indicate whether 
correlations are with essentially fac- 
tual variables or with essentially con- 
ceptual variables. This category again 
begs the question because it is pre- 
cisely in this area that most of our 
problems concerning validity reside. 
There are, in clinical psychology, rela- 
tively few “facts”, if we define a fact 
as a perception about which several 
intelligent lay people would instan- 
taneously concur. When we view a 
body lying in a pool of blood and 
see a holographic document with sui- 
cidal content, it is a “fact” (Alfred 
Hitchcock aside) that the individual 
has committed suicide; but in a not- 
yet-dead individual “being suicidal” 
is almost entirely conceptual. Most of 
what we deal with in clinical psy- 
chology are not facts but hypothesis- 
tinged data. 

4. Validity for what. Under this 
heading it is suggested that (a) the 
nature of the stimuli be delineated, 
and more, that (b) the nature of the 
continua or dimensions along which 
the stimuli are believed to be spread 
be delineated. Dr. Farberow will in 
his paper expand on this notion and 
indicate some of the theoretical and 
practical implications of developing 
specific eae testing tech- 
niques which focus on the elabora- 
tion of stimuli along a single psycho- 
logical dimension. 

5. How Validity is established. This 
would be an indication of the basic 
statistical procedures involved. What 
I have in mind is the notion that it 
is not only what the statistics choose 
to tell, but is also in part which statis- 
tical procedures were chosen. Clini- 
cally this means that even the statis- 
tician must be aware of his own trends 
toward latent homoscedasticity. 

But it is to the notion of congruent 
validity of tests, and the idea that the 
validational process is much the same 
as that involved in evaluating a theory 
itself, that I wish to return for the re- 
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mainder of my introductory remarks, 
and make the to-me-obvious, but | 
fear in some circles somewhat icono- 
clastic comment that our psychologi- 
cal tests or projective techniques are 
not validatable until the personality 
theories that give them their intellec- 
tual permissiveness themselves gain 
greater construct validity. 

And that is not all. The matter is 
further complicated inasmuch as the 
data on the other side of the equa- 
tion, the so-called criterion measures, 
are themselves extremely complex and 
little understood. Let me give two ex- 
amples, using homicide and _ schizo- 
phrenia—I had promised that I would 
not dwell upon either suicide or the 
MAPS test during my presentation. 
On the first thought it would seem im- 
pressive if one were to have not ten, 
but 200 Rorschach records of individ- 
uals who had committed homicide 
and records of 200 individuals who 
had not. But a moment’s reflection 
will lead one to the common-sense 
consideration that individuals kill 
other individuals in a variety of ways 
(in a liquor-store hold-up, in the 
lover’s bedroom, on the highway, on 
the field of battle, etc.) —and that each 
of these occurs with a variety of mo- 
tives and mixed feelings. Putting. all 
the “homicidal” records together 
could only yield a kind of psycho 
metric goulash. (Incidentally, suicide 
is even more complicated.) And, now 
let us, for a few moments, consider 
schizophrenia—at first blush a perfect- 
ly wonderful external criterion for 
status validity. Whose schizophrenia 
do we mean? How shall we define 
schizophrenia? j 

1. Is schizophrenia as Kraepelin be- 
lieved, a mental disease process re- 
sulting from an auto-intoxication fol- 
lowing a disorder of metabolism pro- 
duced by a disordered secretion of the 
sex glands? 

2. Or is it, from Arieti, a primitive 
form of rationality, based on patho- 
logical reasoning in terms of attributes 
of the predicate? 

3. Or, from Alexander, is it a state 
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in which the ego loses its synthetic 
function of harmonizing the different 
instinctual demands? 

4. Or, from Jung, does the peculiar- 
ity of this disease lie in the startling 
emergence of an archaic psychology 
accounting for the innumerable points 
of contact with mythological material, 
so that what we take for original and 
individual creations are mostly prod- 
ucts which can only be compared with 
those of antiquity? 

Incidentally, Jung states that there 
are two types of schizophrenia (as- 
thenic and spastic), Beck says there 
are six, and I had been taught all 
along that there are really four. 

5. Or, is schizophrenia, as Meyer 
felt, to be thought of as a habit dis- 
organization resulting from a progres- 
sive maladaption with increasing use 
of substitute reaction, instead of effec- 
tive ones. 

6. Or, do we really understand 
schizophrenic delusions as a projection 
of repressed passive homosexual de- 
sires? I think the formula is as fol- 
lows: I love him, I do not love him, 
I hate him, No he hates me. Now I 
have it, I hate him because he hates 
me. We hate each other, I don’t think 
that I will invite him to my next gay 
party. 

All these are concepts. But there 
are, happily, three solid facts about 
schizophrenia. The only three known 
facts about schizophrenia are: 

1. We do not know exactly how to 
cure it; its remediation is unknown. 

2. We do not know exactly what 
causes it; its etiology is unknown—ex- 
cept when we say that it is crypto- 
genic. 

3. We do not know its nature. We 
do not know what it is, if it is an it, 
which it probably ain’t. 

In 1958, we in clinical psychology: 
(and individuals no less in psychiatry) 
have to deal primarily with the metho- 
logical problems attendant to con- 
struct validity. 

Our task is in some way similar to 
measuring a floating cloud with a rub- 
ber band—in a shifting wind. I used 
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to hear that projective techniques 
were like taking a plug out of a water- 
melon. This is a delusion of agronomy. 

The science of clinical psychology 
relates primarily to hypotheses, not to 
facts. Whether one believes in Freud’s, 
Adler’s, Jung’s, Murray’s, Allport’s, 
Kelly’s, or any other individual’s per- 
sonality theory, one still has to sub- 
sume his cerebrations under the pur- 
view of an essentially untested hypoth- 
esis, nothing more. As a clinician, an 
individual may (in terms of loyalty) 
be from Vienna, or Topeka, or Balti- 
more; but as a scientist he must be 
from Missouri. 

Nor do I believe that the physical 
sciences should be our models. My 
notion is that we do not do well to 
attempt to emulate physicists (simply 
because they have more observable 
facts interlaced among their hypothe- 
ses) but rather.I suggest that we look 
within our own field, that we re- 
examine (with a view toward escap- 
ing the tactical and strategic errors of 
theory-building) the intellectual man- 
euvers of Wundt, Titchener, Dewey, 
Watson, MacDougall, etc., not to go 
back to them, but better to see that 
they were only trying systematically to 
explore the implications of their hy- 
potheses — all their “facts” followed 
after. Our task might well be to plan 
our scientific maneuvers so that even if 
we adhere more or less rigidly to our 
current hypotheses, at least learning 
from our own past, we might avoid 
making some of the methodological 
errors which beset these earlier move- 
ments. 

It appears that the kinds of validity 
discussed by the APA committee and 
by others are all kinds of “research” 
or scientific validity, whereas it may 
be that we may have to deal for a 
while with faith validity or clinical 
validity wherein we are not concerned 
with the veridical value of a hypoth- 
esis but rather with its professional 
sense-appeal within the scope of our 
contemporary conceptual zeitgeist. But 
on the other hand, it seems that our 
notions ought to be capable of being 
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metricized so that we can test whether 
or not the phenomena we think we 
see, occur beyond chance expectation 
and with statistical significance. 

In order to end on a positive note, 
I want to say that as a meaningful 
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measure of the nature of the uncon- 
scious nothing beats our current pro- 
jective a whatever the un- 
conscious is—if you see what I mean. 


Received December 8, 1958. 
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Variability of Responses and the Stability of Scores 
and Interpretations of Projective Protocols 


DONALD W. FiskE 
University of Chicago 


The literature on projective tests 
contains very little information on the 
consistency of responses. Very little is 
known about the stability of responses 
to the same stimuli at different times 
and almost nothing is known about 
the stability of interpretations of such 
different protocols. 

This paper is concerned pe 
with these problems. We shall see that 
responses to the same stimuli tend to 
be different on different occasions and 
that such instability of response seems 
to have serious effects on the stability ) 
of interpretations. On the other hand, 
some scores or ratings derived from 
successive protocols show fair stability. 

From work on the variability of a 
person’s behavior over time, I have 
come to the postulate that behavior 
in the absence of external stimulation 
is completely variable. The more 
structured the stimulation and the 
more intense the stimulation (up to 
a point), the more consistent the be- 
havior. Stimuli exercise constraints on 
behavior (7) . 

This position is strongly supported 
by several kinds of evidence. First let 
us consider dreams as an example of 
“behavior” of a sort, behavior in the 
absence of stimulation. 

In the systematic work of Kleitman, 
Dement, Wolpert, and Kamiya (4), 
all the dreams a person has during a 
night are recorded immediately after 
they occur. For a number of subjects, 
the dreams of several nights have been 
recorded. Inspection of these shows 
no exact repetitions. The content of 
each dream is essentially unique al- 
though one can, of course, find in- 
stances of similar themes or the de- 
velopment of a theme, especially in 
contiguous dreams. Much of the re- 
peating material is associated with 


the experimental situation!. Thus one 
subject had several dreams, usually 
the first each night, about the situa- 
tion, the electrodes, etc. Another had 
two dreams about people slanting or 
biasing the dream material they re- 
ported. It is worth noting that the 
general topic of these reported dreams 
is common across subjects and is 
largely determined by the general ex- 
perimental situation. 


Another systematic collection of 
rotocols in the relative absence of 
stimulation is in the work of Recht- 
schaffen and Mednick (20) on the 
autokinetic word technique, in which 
it is suggested that the xed point of 
light will trace letters, words, or sen- 
tences. Here also, there is no exact 
repetition of content.? Reports from 
stimulus deprivation studies seem to 
be consistent with these data. In a 
study by Goldberger and Holt (10), 
subjects’ verbalizations during an eight 
hour period of marked stimulus re- 
duction were recorded. Inspection of 
these protocols, and of post-session 
interview reports of experiences dur- 
ing the experimental period showed 
minimal repetition of content.’ There 
were two types of exceptions: two sub- 
jects repeated the same tune twice; 
some subjects, especially those who 
were anxious to escape the situation, 
repeated content such as wondering 
what time it was and wondering how 
he looked as he lay there. The latter 
type of repeated material related to 
the situation is strikingly consistent 





+I am indebted to Drs. William Dement and 
Joe Kamiya for access to these dream pro- 
tocols. 

*I am indebted to Dr. Allan Rechtschaffen 
for access to these protocols. 

*I am indebted to Drs. Robert R. Holt and 
L. Goldberger for access to these protocols. 
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with the repeated dream material 
noted above. 

' When relatively unstructured stim- 
‘uli are present, behavior is more con- 
sistent than in the absence of stimu- 
‘lation, but it is still highly variable. 
One implication of this viewpoint is 
that a person has a very wide distri- 
bution of potential responses, the 
width or range of the distribution be- 
ing greater when the stimuli are less 
constraining. Since projective tech- 
niques are rather unstructured, the 
stimuli and instructions impose rela- 
tively little constraint on responses. 

Thus Bert Kaplan and his col- 
leagues have found that on a second 
testing, normal people can give al- 
most aS many new responses to the 
Rorschach cards as they gave the first 
time (28). In a study by Willis (29), 
subjects gave an average of four or 
five different plots (themes) toa TAT 
card in 10 minutes. I have found that 
subjects can give several responses to 
each sentence completion stem. 

While this evidence is not defini- 
tive, I am sure most of you would 
agree with the notion that a projec- 
tive stimulus at any point in time for 
a single person has several poten- 
tial responses, with varying response 
strengths. Now what is the evidence 
on the variability of emitted responses 
over time? 

For word association, Willis and I 
have some preliminary data indicat- 
ing that almost one-third to one-half 
of the responses are changed when the 
same stimulus word is repeated in a 
single session or in a second session. 
(We asked subjects to give two associ- 
ations to each stimulus. Fifty to 85 
per cent of the second associations 
are changed.) 

For sentence completion responses, 
three studies show that 64% to 80% 
of the responses are changed from 
one trial to the next (8, 18, 19). In 
one study, subjects averaged six or 
seven different responses over nine 
trials. 

The TAT does not permit similar 
comparisons. However, I have exam- 
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ined some sets of protocols, each set 
consisting of three trials 1 to 30 
months apart. Rather than estimate 
similarity, I counted the number of 
times that the three stories the per- 
son told to a card were quite differ- 
ent, one from the other, in manifest 
content. This occurred in about 20% 
of the subject-card combinations. In 
the remaining 80%, the similarity 
ranged from two stories having one 
detail in common, which did not ap- 
pear in the third, to sets of three al- 
most identical stories. Some of the 
repetition was obviously straight re- 
call, as one subject made explicit who 
“saw” each time in the blank card a 
polar bear in a snowstorm! 

Swift (24) reports that about 50% 
of Rorschach responses by 4 and 5 
year old children are repeated 2 to 4 
weeks later. Siipola, Kuhns, and Tay- 
lor (22) indicate that for presenta- 
tions of parts of these blots, about 
half the responses were changed on 
retest. Of these changed responses, 
about half were similar on the two 
occasions. 

On the Blacky Test, Granick and 
Scheflen (11) have reported that 
judges considered that about 57% of 
the thematic content of two repeti- 
tions was similar, but a comparison 
with a matched group yielded one- 
third similarity between responses 
from different children! 

For the Picture Arrangement Test, 
Tomkins and Miner report that 33% 
of the order responses were changed 
over a three week interval and 45% 
were changed over three years (26, 
p. 37). 

The figures we have given are over- 
estimates of response stability because 
there is much evidence that subjects 
remember many of their projective 
— over several weeks (cf. Swift, 
4). 

The reader may be thinking that 
while the manifest content may be 
changed, the psychological meaning 
would be essentially the same. To 
check this, Van Buskirk estimated the 
changes in psychological meaning over 
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the nine trials of the Sentence Com- 
pletion data mentioned earlier. He 
found a high correlation (.80) be- 
tween the extent of change in the psy- 
chological meaning of a person’s re- 
sponses and the extent of change in 
the manifest responses. But this was 
merely suggestive. So we compared 
the interpretations of different a 
cols from the same individual (9) . 

We asked this specific question: 
are the differences between the inter- 
pretations of protocols from the same 
person less than the differences be- 
tween interpretations of protocols of 
different persons? Thus inter-individu- 
al variation, variation between indi- 
viduals, provides a frame of reference 
for evaluating the extent of intra- 
individual variability, variation or in- 
consistency within a person over time. 

We found that in 25% of the judge- 
protocol combinations, the interpre- 
tation was more similar to interpre- 
tations of protocols from different sub- 
jects than to interpretations of other 
protocols from the same subject. This 
25% misclassification was not due to 
the homogeneity of the subjects. The 
manifest content differentiated the 
subjects better than the judges’ inter- 
pretations. 

Once again, such a single study is 
not definitive. It should be repeated 
with other tests and with complete 
batteries. It does, however, suggest 
that the changes in manifest content 
on at least one projective test make 
the interpretations clearly unstable 
over time. 

I hope that the material I have pre- 
sented has indicated to some of you 
that the existence of considerable var- 
iability over time in projective re- 
sponses presents a serious difficulty 
for clinical diagnosis and for person- 
ality assessment as a whole. But the 
picture is not entirely negative: there 
is considerable evidence that some 
characteristics of responses are reason- 
ably stable over time. 

For example, for the Incomplete 
Sentence Blank, Arnold and Walter 
(1) found the Rotter Adjustment 
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Score to have a stability over a week 
of .82. Churchhill and Crandall (3) re- 
port coefficients of .70 to .38 for 6 
month to 3 year intervals. Rohde (21) 
found stability coefficients of .68 and 
.64 (over an eight month period) for 
need variables rated on her Sentence 
Completion Test. The cathections of 
objects were even more stable: less 
than 4% were changed. For the TAT, 
Tomkins (25, p. 6) reports r’s from 
.50 to .80 over two to ten months for 
ratings using a need-press schema. For 
a two-month interval, Lindzey and 
Herman (16) report 17 1's ranging 
from .00 to .94. For a similar period, 
Helen Koch‘ has obtained a compar- 
able range of coefficients on the CAT. 
In the Granick and Scheflen study 
(11) of the Blacky cited earlier, 75% 
of the subjects were consistent in their 
liking or disliking of each Blacky 
card. For the Sargent Insight Test, 
Fassett (6) reports several coefficients 
all below .46. Rorschach scores have 
stabilities from .98 to —.16 (5, 12, 13, 
24) . In some unpublished Draw-a-Per- 
son test data of Shanan* the correla- 
tion between total area used for two 
figures over a six month interval is 
only .44. This is similar to the only 
correlations reported in Swensen’s re- 
view of the literature on the test (23). 
The evidence for the stability of char- 
acteristics of such drawings is not im- 
pressive, but Wagner and Schubert 
(27) cited by Swenson (23, p. 435), 
report a coefficient of .86 for the sta- 
bility of ratings of overall quality. 


Many of these coefficients indicate 
that the scores involved are sufficient- 
ly stable over time to permit compari- 
sons between groups. However, few 
are high enough for dependable use 
in differentiating individuals. There 
is a possibility that certain tempera- 
mental and expressive attributes have 
higher stability: for example, produc- 
tivity as measured by sa’ steven 
Rechtschaffen* has obtained an r of 
-72 for the autokinetic procedure and 
Koch‘ found an r of .78 for the CAT. 





* Personal communication. 
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But such characteristics are not 
unique to projective techniques and 
are certainly not the kind of variable 
which such techniques are designed 
to assess. 


What does the total picture look 
like? 

1. The actual content of responses 
| to projective tests varies consid- 

erably from trial to trial, and 
there are large individual differ- 
ences in the extent of this change 
in content. 

2. The stability of some scores and 
ratings based on projective pro- 
tocols is reasonably adequate. 

8. There is reason to believe that 
changes of responses may have 
serious effects on the stability of 
interpretations of total protocols. 

The problem is not that successive 
protocols are psychologically incon- 
sistent but that they often comple- 
ment each other; material in one may 
not be present in the next, The prob- 
lem then becomes that of determining 
the significance of such material for 
the enduring personality. 

One promising approach has been 
neglected, at least in research litera- 
;ture. This is Murray’s suggestion (17, 
|p. 604-5) of a unity-thema: a central 
theme; compounded from dominant 
needs and press, which can be seen to 
run through a person’s life. 

We need to investigate this notion. 
Does material from different sources 
—such as TAT and other thematic 
protocols indicate the same central 
theme? And do protocols gathered at 
different times embody the same 
thema? 

We can safely assume that the “per- 
sonality” we wish to assess and meas- 
ure is that part of the person’s dis 
sitions and response tendencies which 
is stable, at least over weeks if not 
years. Therefore it is essential that we 
develop methods to distinguish be- 
tween stable and temporary disposi- 
tions, between pre-eminent and sub- 
ordinate response tendencies. I have 
found few explicit discussions of this 
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problem oS (21, p. 109) ; Lind- 
zey (15); mae (2); Klopfer and 
Spiegelman (14) and no development 
of adequate methods. It is my convic- 
tion that the validity of projective 
scores and interpretations will con- 
tinue to be unsatisfactory until such 
methods are available. 
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Structured and Projective Tests: Some Common Problems 
in Validation 


Pau E. MEEHL 
University of Minnesota 


Some of us at Minnesota have been predictive validity, (e.g. prognosis) 
trying to re-think, as objectively as although even here I would agree 
human frailty permits, the principles with Loevinger’s recent emphasis that 
involved in the clinical use of that if predictive validity is to be general- 
structured test with which we are izable over different clinical popula- 


identified. tions, good construct validity is neces- 
In preparing for this symposium I Sary (6) . 
discovered that most of the consider- { It follows that there is very little 


ations which arise in validating such } point in carrying out validation stud. 
devices as the MMPI apply with very | ies unless the non-test indicators are 
little modification to those open-end- \superior, both qualitatively and quan- 
ed tasks which have come to be known |titatively, to the kinds of judgment, 
as the projective techniques. Faced rating, or classification which are rou- 
with a choice between rigorously de- tinely available. If we want the Ror- 
veloping a point or two, and offering schach or the Bender or the MMPI to 
a more extensive list of suggestions in play a role analogous to what the 
a brief and dogmatic form with only EKG or gold-colloid tests play in or- 
a sketch of justification underlying ganic medicine, we may have to adopt 
each, I have preferred the latter, a causal and statistical model more 


In terms of the recommendations of like that of organic medicine. We 
the APA committee on test standards, have nothing strictly analogous to the 
the great majority of validation prob- _ Pathologist’s report (a face-valid “cri- 
lems posed by both structured and / terion ); therefore validation studies 
projective tests are problems of con- | Must employ multiple assessments of 
struct validity. Careful scrutiny of ap- | the same construct, numerous internal 
parent cases of content or concurrent / Checks (sufficiently patterned but pre 
validity will usually reveal that these} served from contamination) , and some 
rubrics are only superficially appro-| Procedure for further checks on 
priate. The most obvious example o whether an apparent test miss is actu- 
this is in the use of tests as an aid inj lly a miss or only seems to be so. For 
formal diagnosis, where there is no example: We study the subset of cases 
point in giving a test in order to pre- in which the Rorschach strongly indi- 
dict the verbal behavior of the psychi- ated a schizophrenic structure but 
atric staff, Think of the so-called Which were, by the usual concurrent 
“hits” and “misses” in a validation Validity criterion, not so diagnosed; 
study employing routine psychiatric then we might determine whether Q 
diagnosis as criterion. Almost the sole sorts by the residents on each case 
value of such investigations is their tended to show a higher Q-correlation 
relevance in deciding whether the test with an idealized Q-sort description of 
can be used as a probabilistic indi- “Ps¢udoneurotic schizophrenia” than 
cator, along with psychiatric judg- 4s observed in randomly chosen non- 
ment or any other “criterion,” of the schizophrenic cases. 
presence of that inner state which is Another approach with which we 
theoretically definitive of a particular are currently working in our Multi- 
taxonomic entity. An important min- phasic studies utilizes “temporal con- 
ority of exceptions are those of truly vergence” to the psychometrics. That 
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is, recognizing that neither the test nor 
the non-test indicators (I prefer the 
term indicator to the term “criterion” 
in such contexts) is infallible, one can 
at least study the extent to which the 
non-test indicators and judgments 
tend to converge to the characteriza- 
tion yielded by the test as more non- 
test information is accumulated. Re- 
membering the attenuation formula, 
the demonstration that a set of rela- 
tively unreliable clinical indicators 
converge over time (as a function of 
repeated and qualitatively diverse ex- 
posure to the patient) toward the per- 


ception of the patient provided by the ' 


test is encouraging, even when the ab- 
solute magnitude of the asymptote of 
this convergence function is mediocre. 

A perennial problem in the valida- ; 
tion of projective devices (which they 
share with structured devices when the 
latter are multi-variate) is the exact 
role that should be given in validation 
studies to the so-called “global” ap- 
proach. While the method of correct 








matchings has been urged as the solu- 
tion of this problem, and while I 
would readily agree that it has a defi- | 
nite place (especially in the early 
stages) , I do not believe it can be de- 
fended as the method of choice. It does 
not present a very true-to-life valida- 
tion context, since we are practically 
never concerned clinically to identify 
“who somebody is.” The method does 
not reveal which variables are doing 
the work; a device which purports to 
tell us several different kinds of things 
may appear better or worse than it ac- 
tually is on the basis of a small num-: 
ber of crude and clinically uninterest- 
ing dimensions to which matchers 
have learned to attend selectively. Our 
global thinking must not mislead us 
into forgetting that in the training of 
students we do teach them to pay at- 
tention to certain scores to which the 
test gives rise and to attach differen- 
tial interpretive significance to each. 
That this is done contextually i.e. with 
reference to the pattern of the remain- 
ing scores, does not suffice to justify 
the matching method, although it does 


\ 
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demonstrate the inappropriateness of 
confining a statistical analysis to the 
calculation of Pearson correlations be- 
tween single-test dimensions and sin- 
gle-criterion variables. My own opin- 
ion is that Q-sort provides both quan- 
tifiability and globality, and is there- 
fore preferable both to the traditional 
single-variable correlation on the one 
hand and the method of correct match- 
ings on the other. There are unsolved 
problems with Q-technique, but our 
MMPI research has pretty well con- 
vinced us that for many clinical vali- 
dation purposes it is the best method 
pose 9 I think it unfortunate that 
the method detailed by Cronbach in 
1948 has not been more widely used, 
although the number of judges re- 
quired has no doubt militated against 
its application (1). 

It would be nice to have some sta- 
tistical gimmick for measuring the 
similarity of two complex characteriza- 
tions of one person’s dynamics and 
structure. This is not achieved by the 
method of matching, by single-trait 
correlations, by Q-correlations, nor, so 
far as I’m aware, by any method yet 
proposed. If I describe a person by the 
usual mixture of genotypic and phe- 
notypic statements, the dynamic — 
of view sees each facet of the pheno- 
type as an indicator of the genotype; 
and it is in some sense a more serious 
mistake to utter the single proposition, 
“This patient has rigid reaction for- 
mations against his unusually strong 
unconscious hostile impulses” (if that 
statement is in fact false) , than it is to 
include erroneously a frequently pres- 
ent indicator, e.g., “The patient will 
be docile in accepting interpretations 
[because he will perceive disagreement 
as an aggressive act]” If the characteri- 
zation were presented in numerical 
form as estimates of the patient's co- 
ordinates in the variables of the indi- 
cator or surface trait space, and similar 
estimates in the underlying causal or 
factor space, one of the available dis- 
tance measures might be Te 
(2) . Short of that idealized situation, 
however, one has the feeling that any 
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numerical measure of the “closeness of 
fit” between two structural-dynamic 
characterizations would be excessively 
arbitrary. 

I cannot refrain from a critical com- 
ment here with regard to the globality 
problem. Some clinicians are still con- 
fused between the complex psycholog- 
ical processes of the test-interpreter in 
arriving at his final integrated charac- 
terization of the patient, and the sta- 
tistical problem of validating that in- 
terpretation. After he has weighed the 
various possibilities and taken account 
of the higher order interactions as best 
he can, the clinician does produce a 
set of utterances which are, taken sin- 
gly, accurate in varying degrees. It is 
quite appropriate to determine the 
number or average amount of errors 
in the final description by considering 
the statements individually. Such a 
procedure, which studies the final re- 
sult of the clinician’s cerebrations, 
seems to me to be an unavoidable part 
of validation. This methodological 
point was made crystal clear by Kelly 
and Fiske (4) in their exposition of 
the Michigan assessment program’s de- 
sign, but in spite of the lack of any 
rational reply to their arguments the 
principle has even yet not completely 
sunk in. 

A tough practical problem in the 
validation of multivariate instruments 
is that of utilizing higher order inter- 
action effects among the test variables. 
The qualitative merits of the inde- 
pendent construct variable estimates 
should be excellent, as I’ve pointed 
out above, otherwise we are wasting 
our time, This usually means that the 
patients have been subjected to in- 
tense scrutiny by judges of a high level 
of clinical competence. On the other 
hand, the capitalization of random 
sampling errors increases horrendous- 
ly as patterning effects are taken into 
consideration, so that the determina- 
tion of a function and the assignment 
of optimal weights requires a large 
sample. In other “ene of applied 
psychology or even in clinical psychol- 
ogy where predictive validity is really 
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the problem, it often suffices to know 
a very little bit about a large number 
of persons; for other purposes, as in 
carrying out individual psycho-ther- 
apy, it suffices to know a very great deal 
at a deep level about a single person. 
In validating the configural indicators 
of a multivariate test, we find ourselves 
in the unhappy position of requiring 
to know a great deal about a large 
number of persons. 

I come now to some considerations 
relating validation studies to clinical 
practice. Since the pone may agp of 
multivariate devices involves the test- 
clinician combination (i.e., mechani- 
cal or cookbook procedures do not ex- 
ist), at what level of clinical skill 
should the clinicians in validation 
studies be chosen? One can put several 
questions, each of which makes theo- 
retical and practical sense. We cannot 
dismiss negative validation studies on 
the grounds that the Rorschachs used 
were not being read by Klopfer but by 
“ordinary” clinicians of only moder- 
ate skill. One empirical property of a 
clinical device is how much valid in- 
formation is extracted from it by per- 
sons who have achieved that degree of 
proficiency typical of the population 
of users. Suppose that only 5% of peo- 
~ who are ABEPP diplomates, who 

ave attended Rorschach workshops 
and have administered 300 or more 
Rorschachs under supervision, can ar- 
give at clinically useful conclusions 
with the instrument; the taxpayer is 
being somewhat defrauded if the other 
95% draw part of their salary on this 
@basis. A second question is, “How 
much valid and clinically useful infor- 
mation can be extracted by the most 
skillful interpreters of an instrument?” 
Finally, one may ask, “How much in- 
formation is intrinsically contained in 
the test behavior?” We are beginning 
to collect some evidence with regard to 
the MMPI that nobody gets all of the 
information out (3, 7, 8) and I would 
assume that to be equally true of less 
structured devices. 

“Clinically useful information” is 

actually a rather complicated notion. 
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The weakest claim that can be made 
for a test is that it tells us things about 
the patient which are true. It is a re-® 
markable social phenomenon that, 
within an allegedly scientific profes- 
sion, even this claim has not as yet 
been clearly supported with respect 
to the majority of the inferential uses 
to which our favorite devices are being 
put. Secondly, one may ask, “How 
much valid information does the in- 
strument yield which is not readily 
available from other sources present 
in any clinical context and who will* 
routinely be making these judgments 
regardless of whether we use the test?” 
A third question is, “How much soon- 
er in the time sequence does such in- 
cremental information become avail-, 
able to us through the use of the test?” 
If my test tells me something about the 
patient which I do not get from the 
initial diagnostic interview but which 
I will be able to assess with a relatively 
high 'degree of probability by the end 
of the fifth therapeutic hour, the use 
of the test is hard to justify. 
Finally—the really important ques- 
tion—““What is the incremental fi 
ciency in clinical handling (that is, 
both, in gross administrative decision 
making and in the moment-to-moment 
decisions made in the course of psy, 
chotherapy) achieved through incre- 
mental information provided in ad- 
vance by the test?” The difficulties of ' 
designing experiments to answer this 
fourth question are considerable, but 
it seems to me obvious that the ulti- 
mate justification for the clinical use 
of devices, projective or otherwise, 
must lie here. I am myself inclined to 
be somewhat skeptical in this regard, 
and it is a source of continual amaze- 
ment to me the extent to which many 
clinicians take the answer for granted. 
In concluding, I cannot miss the op- 
portunity to attack an unfortunate 
tendency, springing no doubt from 
over-identification with our favorite 
devices (combined with a lurking fear 
that they may not be quite as valid as 
“everybody knows” they are) , to reject 
any experimental or statistical design 
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which is offered, on the ground that it 
is artificial or not in accord with ac- 
tual clinical practice. True, some un- 
imaginative designs have been em- 
ployed to produce negative results 
when a more sophisticated approach 
would have elicited the validities ac- 
tually present. But I think it is time 
for some blunt speaking within our 
profession about the disparity which 
exists between even the best available 
studies of validity and the amount of 
faith—and I use here a theological 
word deliberately—invested by a large 
number of practitioners in current 
methods. Studies such as that of Kost- 
lan (5) in which the incremental in- 
formation provided by each of several 
instruments is estimated by showing 
the effect of subtracting that source 
from the total pattern of data avail- 
able, are sufficiently close to “real life” 
clinical practice that no one is ethical- 
ly and scientifically entitled to ignore 
the findings. There is far too great a 
tendency to invoke “clinical expe- 
rience” as though it were a satisfactory 
answer to the validation question. 
Such a mistake could only be made by 
someone ignorant of the history of 
science (particularly of the medical 
sciences) or remarkably naive about 
those errors in observation, memory, 
and interpretation to which the hu- 
man intellect is subject. 
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A Study in Validation of the Thematic Apperception Test’ 


WILLIAM E. HENRY 
University of Chicago 
JANE FARLEY 
Parmly Foundation for Auditory Research, Illinois Institute of Technology 


The study reported is based upon an 
analysis of variance design which eval- 
uates the degree of correctness with 
which TAT interpretations can be 
identified with the correct TAT sub- 
jects, in randomized matching. “Blind” 
interpretations of TAT’s administered 
to a group of 36 14-15 year old adol- 
escents are systematically compared 
with similarly “blindly” derived sum- 
maries of selected criterion data (ob- 
jective test scores, summaries of obser- 
vations and interviews, and Rorschach 
interpretations) . All instruments rep- 
resented in the criterion data had been 
summarized independently, but each 
summary was patterned on the same 
conceptual framework, agreed upon in 
advance by each of the clinicians in- 
terpreting the various instruments. 

The present study, using the data 
described, follows a model proposed by 
Cronbach (1). The 36 adolescent sub- 
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jects were divided into 12 “triads.” 
Criterion material for each triad was 
separated into statements, and grouped 
into four general concept areas (Peer 
Relations, Family, Mental Function- 
ing, and Emotional Adjustment) . Ex- 
pert judges were then asked to match 
individual TAT interpretations 
against a series of statement sheets in 
which criterion statements from three 
subjects were randomly represented, 
giving a “t”, a “0”, or a “?” rating to 
each comparison. Modifications of the 
Cronbach design permit the derivation 
of statements assessing the influence of 
various effects of the judging proced- 
ure, to include the effect of the indivi- 
dual case being judged, the order of 
the presentation of cases, the effect of 
the particular criterion instrument be- 
ing used, the sequence of cases judged, 
the behavior area being judged, and 
others. 

Since the judging operation com- 
prises two tasks — correct selection of 
matching items, and correct rejec- 
tion of nonmatching items — the cor- 
rectness “scores” of judges were sim- 
ilarly divided. Judges’ scores were 
treated separately, as reflecting on the 
ability of the TAT to identify descrip- 
tions of a particular individual, or on 
the sensitivity of the TAT in discrim- 
inating between individuals. The an- 
alysis of variance was undertaken on 
both identification matchings (I- 
scores) and upon discrimination 
matchings (D-scores) . 

In all instances, hypotheses related 
to the validity of the original blind 
TAT interpretations are demonstrably 
true at a level of confidence very con- 
siderably beyond the .01 level. This 
level of confidence is based on the F- 
distribution, and makes use of a spe- 
cial index developed by Dr. David 
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Wallace of the Department of Statis- 
tics, University of Chicago. The Wal- 
lace Index is used in computing each 
correctness score which is entered in 
the F-equation, in such a way that the 
resulting F-value is the percentage by 
which the correctness of that score ex- 
ceeds chance. The high level of suc- 
cess thus obtained holds for both the 
I-scores and the D-scores. Certain dif- 
ferentiations between these scores will 
be discussed below. 


POPULATION 


The 36 cases involved were adoles- 
cents of approximately 14 years of age, 
all from the same school in a midwest- 
ern community of under 10,000 pop- 
ulation. All were “normal,” in that 
they were going to school, living in 
their own homes. Half of the subjects 
were girls, half boys. 


TAT INTERPRETATIONS 


The basic material to be judged con- 
sisted of individual TAT interpreta- 
tions made by one person upon the 
records of these 36 adolescents, Inter- 
pretations were made blind for each 
subject, with the exception of knowl- 
edge of sex and age. 


CRITERION SUMMARIES 


The basic data against which the 
TAT interpretations were judged for 
fit were derived from instruments rep- 
resenting three general types, or levels, 
of material: 

1. Objective Behavior—Sociometric 
and Guess-Who Tests, Family Rela- 
tions Questionnaire, Stanford-Binet, 
academic achievement tests, school 
grades, and the Emotional Response 
Test of Bavelas. 

2. Subjective Behavior — derived 
from observations and interviews with 
teachers, parents, peers in home and in 
school. 

3. Projective Behavior — the Ror- 
schach. 


CONCEPTUAL FRAMEWORK 
The framework of concepts used was 
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identical for each criterion instrument, 
and for the TAT (2). It included two 
gross areas of personality, Social Inter- 
action, and Characteristics of Self. The 
Social Interaction elements include 
peer relations and family relations. 
The Characteristics of Self elements 
include mental functioning, emotional 
reactivity, inner adjustment, the de- 
fense mechanisms, and so on. In all 
major areas, with their various sub- 
divisions, the personality analysis of 
each instrument was divided still fur- 
ther into 27 statement areas. In the 
final materials for judging, each state- 
ment area was directly comparable. 
Thus statement No. 1 from any one 
of the criterion instruments always 
deals with the “Degree of participa- 
tion in Peer Groups” (in the Social 
Interaction area), statement No. 16 was 
always material related to “Basic Emo- 
tional Attitude,” (from the Self area) , 
etc. 

The 27 statement areas were grouped 
on the statement sheets according to 
four larger categories—I. Peer Rela- 
tions, II. Family Dynamics, III. Mental 
Functioning, IV. Emotional Adjust- 
ment. 


ASSIGNMENT OF CASES TO TRIADS 


The 18 male and 18 female cases 
were then randomly assigned to triads, 
and the position of each subject within 
the triads similarly randomly assigned. 


THE JUDGES AND THE JUDGING 
PROCEDURE 


Nine psychologists, all experienced 
clinicians and projective-analysts, were 
chosen as judges. For these nine judges, 
three different presentation patterns 
were designed, with three judges (a 
judge “‘trio’’) receiving each of the pat- 
terns. In this way, replication within 
trios makes possible a check on the re- 
liability of judging, and each TAT 
summary is matched with each criter- 


ion equally often. The presentation - 


orders were balanced, to avoid or to 
detect practice effects, or any inter- 
actional effects from the sequence of 
criterion type, sequence of individual 
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subjects in a given triad, and so on. 
Scoring instructions for the judges 
were standardized, and each judge’s 
ratings were made on separate scoring 
forms. 


ANALYSIS OF DATA 


We have undertaken two largely in- 
dependent analyses of variance aimed 
at the analysis of the judgments of 
Errors of Identification (I-scores) and 
at the Errors of Discrimination (D- 
scores) . Within each, we have analyzed 
the contribution made by some of the 
various effects of the judging process. 


RESULTS 


Since the details of the statistical an- 
alysis will be reported in a subsequent 
monograph, we will limit ourselves to 
a mere statement of our conclusions. 

1. The Validity of the TAT, We 
selected, as the evidence for a general 
validity hypothesis, I-scores which 
showed a greater than chance rela- 
tionship between TAT statements and 
criteria, at a .01 level of confidence. 
All of the I-score blocks show signifi- 
cant agreement considerably beyond 
this .01 level. There was no significant 
variation between the types of cri- 
terion instrument (objective tests, in- 
terviews, Rorschach) in the agreement 
of each with the TAT predictions. Al- 
though various effects of behavior 
area, presentation order and sequence, 
individual subjects, and their various 
interactions, were for the most part 
highly significant in their influence 
upon the scores, even the lowest agree- 
ment scores were far above chance in 
every case. None of the effects or inter- 
actions tested depressed the scores of 
any block below the .01 significance 
level. Similarly, despite a rather sur- 
prising lack of conformity among the 
judgements of these expert judges, all 
judges’ scores matched the correct “I” 
ratings at or above the .01 level. 

2. The Discrimination Power of the 
TAT. We find an hypothesis about the 
usefulness of the TAT in expressing 
the uniqueness of individual person- 
ality unequivocally supported by the 
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high Discrimination (“D’) scores ob- 
tained. These scores, like the Identi- 
fication scores, are significant consider- 
ably beyond the .01 level. 


3. The Scope of the Concepts Rele- 
vant for the TAT. There is no area of 
concepts, no criterion instrument, no 
presentation effect which reduces the 
significance level of the correctness of 
TAT-criterion matchings. Thus we 
can claim for the TAT, within the lim- 
its of the four broad and 27 detailed 
concept areas, and within the popula- 
tion dealt with, the wide scope implied 
by the generality of our hypotheses. 
However, some differentiations within 
the various effects of judging were 
found. By far the greatest single effect 
on the judging process relates to dif- 
ferences in the behavior area in which 
the prediction was made (Peer Group, 
Family Dynamics, Mental Function- 
ing, Emotional Adjustment) . This ef- 
fect is entirely in keeping with expec- 
tation. The reason commonly given for 
assessing personality by a battery of 
tests, rather than by a single instru- 
ment, is that the tests will be focussed 
upon different areas of personality 
function. Scores in the four areas men- 
tioned divide themselves neatly into 
two groups—a Peer Group-Emotional 
Adjustment ee and a Family 
Dynamics-Mental Functioning cate- 
gory — with the two sections in each 
group showing virtually the same per- 
— of agreement above chance, 
and the former group noticeably more 
accurately judged than the latter. It 
should be noted that the lower effi- 
ciency of ap gee in these latter 
areas is only a relative matter, since 
agreement was significantly beyond 
chance for all areas. There are several 
possible explanations of this differen- 
tial agreement. It may reflect a true 
differential ability of the TAT. It may 
reflect the particular preoccupations of 
the life stage of our population. It may 
reflect the particular foci of interest of 
the TAT interpreter. 

4. The “Fatigue” Effect. Between 
the first, second, and third sections of 
the total matchings undertaken by 
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each judge, there is a consistent, 
though not extraordinarily large, de- 
cline in performance. This applies to 
both I and D scores, and to all pre- 
sentation orders, and seems to repre- 
sent a simple function of fatigue. 


5. Conclusions about the Clinical 
Use of the TAT. 


a. There is no single correct way of 
employing the TAT interpretations. 
There was little individual-item- 
agreement between judges, but each 
judge made enough “correct” judg- 
ments to yield the highly significant 
agreement figures reported. Judges 
may arrive at essentially the same in- 
terpretive implications by quite dif- 
ferent routes; or judges may differ in- 
dividually in their ability to utilize 
TAT predictions in different concept 
areas, or for different subjects. But 
whatever the route used, all judges ar- 
rived at a substantially accurate un- 
derstanding of each subject, despite 
great inter-judge variability. 


b. Judges appear to make more effi- 
cient use of the projective (TAT) re- 
port when the decisions they must 
make are definitive and detailed, than 
when they are relatively simple; i.e., 
because there is overlap in the be- 
havioral repertoires of individuals, it 
is easy to make the mistake of includ- 
ing in a subject’s potential responses 
those which are more characteristic of 
someone else. It is more difficult (and 
more inaccurate) to exclude from a 
subject’s potential responses those he 
in fact displays. Conversely, it is less 
difficult to make the second type of 
judgment correctly than the first. Al- 
though there were more mistakes of 
the “D” type, as expected, the judges 
also varied much more widely in their 
ability to make the “I” type of judg- 
ment accurately. We have concluded 
that the “set” necessary for making fine 
distinctions (concentration, delibera- 
tion, and impersonal detached judg- 
ment) are more conducive to evenly 
accurate diagnosis than the more spon- 
taneous, personally colored approach 
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used in making the less tedious comjstat 
parisons. firm 

c. The judges’ perceptions wer§ ° 
sharpened by consideration of a variet}'ht' 
of personality data, but there is a poing"0r 
of diminishing returns. Although 4¢*P 
certain amount of facilitation in judg}!"5! 
ment is produced by examining thq™ifi 
subject from several angles, there igPS 
also a “fatigue” effect produced by ( 
superfluity of information, and this effin | 
fect becomes dominant when the clinj por 
ician considers many subjects. ject 


d. Clinicians have their particulag° ' 
‘specialties.’ The accuracy of match} | 
ing differed with each judge, from inj” 
dividual to individual. There was, at all 
the same time, no clear agreement be} _ || 
tween judges as to which individuals ¢™' 
were judged more accurately, Thej‘° 
exact cause of the divergence is not be) 
clear, but the crucial variable is ap4P'¢ 
parently strictly personal. That is, it}5¢4 
is explained by a difference in the way} Sh¢ 
a judge perceives certain subjects or| th 
groups of subjects. It may be that the} fur 
judge is especially expert at recogniz-|™¢ 
ing all hysterics; another judge bas 
matches boys better than girls; a third] 
simply finds certain individuals more] be 
interesting than others, and is thus} be. 
more involved with those individuals.} th¢ 
In any case, each judge fluctuated in} be 
his ability to judge different individ-} ># 
uals, and no two judges fluctuated inj ™4 
exactly the same direction. inf 


e. Instrument. One negative finding 
is fully as important as the positive re- 
sults of any other of the tested varia-| 4" 
bles; this is the nonsignificance of the} |}, 
differences between criterion instru-] | 
ments, and of the instrument-area in- b 

° * u 
teraction. It was confidently hoped 
that the results of this validation study (q 
would provide some clue to the place 
of the TAT in a clinical test battery § 
for personality assessment. Since three 4 
different types of instrument were in- | ** 
cluded among the criteria against fl 
which the TAT was matched, there § *™ 
was reason to believe that the TAT 
would show itself to be closer to one 
than another of these techniques. In 
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s comjstating the evidence acceptable for con- 
firmation of the validity hypothesis, 
- wera “standard” was selected from the 
varietythree criteria for each area of predic- 
poingtion. This selection was made in the 
ugh geXpress anticipation of notable area- 
» judgjinstrument interaction. The nonsig- 
1 thqnificance of these effects presents three 
ere igpossible interpretations: 
iby 4 (1) There is actually no difference 
this effin level or in accuracy between the re- 
e clinjports made from the objective, sub- 
jective, and projective tests, as applied 
‘iculag ‘© the four behavioral areas. 
natch} (2) The TAT is so flexible, and so 
ym inj comprehensive that it taps all levels of 
vas, all areas equally well. 
nt be} (3) The interpreters of each of the 
idualg Criterion instruments knew their tests 

The} so well that none of them ventured 
is not} beyond the information legitimately 
is ap| Provided by those instruments; con- 
is, itpsequently, the criterion information 
e way} Should vary in richness and clarity as 
cts or| the information sought was nearer or 
at the}further from the focus of the instru- 
ogniz-| ment; but the statements would not be 
judge based on an overextension of the in- 
third] Strument, and would be less likely to 
more} be false than a forced statement might 

thus} be. In this way, judges comparing 
duals,| these statements with the TAT might 
ed in| be required to make decisions on the 
divid-| basis of superficial or meager infor- 
ed in} ™ation, or might be faced with a “no 
information” item, but would prob- 
ably not be asked to match the TAT 
with an incorrect statement. Conse- 
quently — assuming a consistent level 
of validity for TAT interpretation — 
there might be no significant change in 
error score (false plus or false zero) , 
but a great deal more uncertainty 
(question mark) . 

The first alternative (equivalence of 
tests) we can discard, since it is contra- 
dicted by all of clinical tradition and 
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well be partially true. However, the 
“one perfect” clinical test idea does 
not agree with personality theory, with 
test theory, or with the experience- 
based opinions of testers. 

The third alternative (limitation of 
the scope of interpretation in criterion 
data) —or at best a combination of the 
second and third — is probably the 
truest explanation of the unexpected 
negative finding. We have one ready- 
made index to the plausibility of this 
explanation: the hypothesized “mea- 
gerness” or “superficiality” of data are 
matters of opinion, which it would be 
difficult to verify; but the “no infor- 
mation” items are perfectly definite 
and verifiable. The distribution of no- 
information items lends credence to 
our answer. Using the same criterion 
for each area, we find the “no informa- 
tion” items distributed according to 
this pattern. The instrument-area com- 
binations chosen as ideal have the 
smallest number of N.I. entries. The 
smallest proportion of N.I.’s given for 
the observation and interview criterion 
was in the Family Relations area; the 
smallest proportion of N.I. items in 
the Emotional Adjustment areas is 
from the projective (Rorschach) cri- 
terion; the smallest proportion of ob- 
jective “no information” items is found 
in the Peer Area. The single contra- 
dictory entry in the table is the large 
number of N.I. items found for the ob- 
jective criterion in the Mental Func- 
tioning area. However, 70 per cent of 
these data failures occurred on items 
14 and 15, which concern creativity 
and fantasy, and about which the ob- 
jective data might be expected to have 

ess to say. 
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What is a Criterion?' 


EvELYN HOOKER 
University of California at Los Angeles 


I take the liberty of departing from 
the general pattern followed by most 
of my colleagues in this symposium, 
that is, concern about aspects of the 
validation process in general as applied 
to ye ago techniques. Instead, I 
shall try to raise some issues about val- 
idation within the context of a spe- 
cific clinical area, namely, homosexu- 
ality. This is done with the hope that 
the issues may be clearer if seen with 
concrete application; moreover, the 
theoretical and practical importance 
of validating projective test instru- 
ments for the diagnosis of homosexu- 
ality needs no special documentation. 


I agree with Professor Meehl that 
the majority of validation problems 
are those of construct validity and that 
careful scrutiny of apparent cases of 
concurrent validity will reveal that 
these rubrics are only superficially a 
propriate. I would extend the case still 
further, to postulate that studies which 
restrict themselves to concurrent valid- 
ity are relatively sterile. Unless our va- 
lidity studies make clear the nature of 
the psychological phenomenon being 
investigated, without depending solely 
on a diagnostic label of the behavior 
which constitutes the criterion, we are 
simply technicians — and not very 
good ones, at that. 

Let me try to make this clear by 
turning to validation studies of overt 
homosexuality. The tests which have 
been most frequently used for the 
diagnosis of homosexuality are the 
Rorschach and the TAT. The current 
validity status of the two instruments 
can be fairly quickly summarized. In 
only a bare handful of studies has an 
attempt been made to validate both of 
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these instruments by comparing overt 
homosexuals with non-homosexuals, 
using test indicators called “homosex- 
ual indices.’’ Davids, Joelson, and Mc- 
Arthur (1), using the Wheeler (9) 
homosexual-content indices in the Ror- 
schach, found that these would differ- 
entiate a group of overt homosexuals 
who were college students from a 
group of non-homosexual “normal” 
students and also from a non-homosex- 
ual neurotic group of students but 
that they would not reliably distin- 
guish individuals within these groups. 
Ferracuti and Rizzo (2), in a study 
conducted in Italy, found that the 
Wheeler indices, taken as a whole, 
would not differentiate a group of 
homosexual prisoners from non-homo- 
sexual prisoners but that the presence 
of any one sign or any two signs would 
distinguish the two groups. Nitsche, 
Robinson, and Parsons (7) , using a list 
of content categories similar to Wheel- 
er’s, found that the homosexual signs 
did not differentiate a group of “re- 
cently convicted” homosexuals from 
patients with no known homosexual 
activity. In my own work (4) , the total 
Wheeler content-indices discriminated 
between a group of overt homosexuals 
and a group of non-homosexuals only 
when the differences in matched pairs 
were taken into account. Thus, the dif- 
ferences are so small that for clinical 
— they are unimportant. Add 
to the discrepancies in the findings just 
cited the fact that, in the groups of 
records with which I worked, expert 
Rorschach workers were unable to dis- 
tinguish homosexual from heterosex- 
ual records, even when presented with 
matched protocols. In none of these 
studies has it been found that indi- 
vidual records could be validly diag- 
nosed: the criterion “overt homosexu- 
ality” obviously covered a “multitude 
of sins.”” Kinsey and his associates (5) 
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have demonstrated the fallacy of as- 
suming that a homosexual is an in- 
dividual who has engaged in a homo- 
sexual act. Unless the extent of hetero- 
sexual activity and/or homosexual ac- 
tivity, together with the pattern of de- 
sires, is made clear, the criterion be- 
comes ambiguous. 


Let us take a look at the results with 
the TAT. As far as I know, only three 
studies have been reported in the lit- 
erature. Davids, Joelson, and Mc- 
Arthur (1), in the study previously 
reported, used the TAT in addition to 
the Rorschach. Using TAT homosex- 
ual indices gathered from the litera- 
ture, they found that these, taken as 
a group, would differentiate the homo- 
sexual from the non-homosexual group 
and that the Rorschach and TAT signs 
were significantly correlated. Ferracuti 
and Rizzo (2), using a somewhat sim- 
ilar set of TAT signs, found that they 
did not distinguish the two groups. 
Lindzey, Tejessy, and Zamansky (6), 
also using college students, have found 
that the TAT indices will differentiate 
a homosexual from a non-homosexual 
group but that an uninstructed and 
unhampered clinician could do a 
much better job of differentiating than 
would the objective rater. I found that 
the TAT and MAPS protocols of the 
homosexuals previously described in 
connection with the Rorschach study 
(4) were so full of manifest homosex- 
ual content that neither the judgment 
of a clinician nor an objective rating 
of indices was necessary in order to 
distinguish the homosexual from the 
heterosexual records. The TAT and 
MAPS thus would appear to be more 
valid instruments for detecting homo- 
sexuality than is the Rorschach. But 
let us not be hasty! Lindzey, Tejessy, 
and Zamansky state (6, p. 73), “It is 
evident that the variables that function 
best are precisely those which one 
would not expect to encounter when 
dealing with strongly repressed homo- 
sexuality or with an overt homosexual 
who was attempting to conceal this 
aspect of his behavior.” I would like to 
underscore that remark heavily, not 
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only for the TAT and MAPS but, also, 
with somewhat less confidence, for the 
Rorschach. A subject who had given a 
Rorschach record which, up to this 
point, gave no indication of homosex- 
uality, produced an oral-genital re- 
sponse which he prefaced with, “I hate 
to say this, because it will give me 
away.” He had already “given himself 
away,” since he had told me he was 
homosexual although he was married. 
He also had told me that he would not 
have come to me were I living in his 
home town. He successfully conceals 
his homosexuality, and is a highly 
respected professional man in his com- 
munity. Another said, “I have heard 
that homosexuals are supposed to see 
sex in this, but I honestly can’t.” Test 
subjects these days are obviously no 
longer as naive as we like to believe. 

The situation is not very encourag- 
ing for the clinician who wishes to use 
any of these three projective instru- 
ments for the diagnosis of homosexu- 
ality. All of the investigators point out 
that the use of the signs for individual 
diagnosis is unwarranted — unless, in 
the TAT, the subject virtually tells us 
that he is a homosexual. Why not let 
him tell us directly? If all one is look- 
ing for in the projective protocol is the 
answer to the question, “Is he homo- 
sexual,” one may be using an inefhi- 
cient means of obtaining the answer. 
If the subject is set not to give the show 
away, a projective test may not help us 
much. 

Can the situation be improved? Yes, 
I think it can be, provided we begin 
to think seriously about the fact that 
studies of concurrent validity are, in 
reality, either good or poor studies of 
construct validity. Or, to put it in a 
slightly different way, we cannot hope 
to use projective techniques to answer 
the question accurately, “Who is a 
homosexual,” until we know much 
more about who or what a homosex- 
ual is. 

In the studies cited, two personality 
pictures of “the” homosexual have 
been used. To quote Wheeler (9, p. 
123) , “If it is possible to accept the ra- 
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tionale for each item in terms of the 
aforementioned results, a combined 
picture of the male ‘homosexual’ 
would be: A somewhat paranoid indi- 
vidual with derogatory attitudes to- 
ward people, especially women, which 
is accompanied by a feminine identifi- 
cation. There are indications of anal 
interests and interest in physical rela- 
tionships between like beings. There is 
apparently some preoccupation with 
sex in general and some autoerotic con- 
cern.” To quote Lindzey (6, p. 74), 
“Our tentative findings suggest that 
the TAT protocols of the individual 
characterized by strong homosexual 
tendencies display consistently nega- 
tive attitudes toward women, a lack of 
full, rich, and satisfying relations with 
members of the opposite sex, and oc- 
casional reference to manifest homo- 
sexuality.” A basic assumption which 
they have in common is that “the” 
homosexual exists and that he is char- 
acterized by attitudes consistent with 
hatred of, or at least negative attitudes 
toward, women—a relatively simple 
construct. I venture to suggest that not 
only is it too simple but it is not alto- 
gether true. Let me hasten to add that 
many homosexuals can be character- 
ized in this way; but even for them it 
is a caricature of the complexity of the 
individuals involved. At the risk of 
stating a self-evident fact, it is ——— 
to anyone with knowledge of the wide 
individual variations in homosexual 
patterns and life styles, as well as to 
anyone who has looked at projective 
test protocols or life histories, that 
homosexuality is not an entity but is, 
rather, a multi-manifested phenom- 
enon. 

It is precisely here that projective 
techniques can help us. We need to get 
beyond the fact that the individual is 
homosexual, to the kind of homosex- 
ual that he is. We are badly in need of 
fruitful theory which will take into ac- 
count the multiplicity of variables 
which appear to be operating. Pro- 
jective techniques can be valuable 
sources of constructs for such theory- 
building. It is not an easy task; and it 
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is one which, I think, will not be ac- 
complished by the development of 
“homosexual signs.” It seems unlikely 
that signs will help us to diagnose or 
understand individuals who are as 
varied as—to quote judges who have 
interpreted the projective protocols— 
“an impoverished, restricted, promis- 
cuous, compulsive homosexual, devoid 
of the capacity for deep feeling”; “a 
straight heterosexual—except in the 
bedroom”; “sensitive, independent, 
autonomous, with strong control of 
impulses and with a moral attitude to- 
ward sex”; “a itive homosexual 
who goes through life gently, with lots 
of strength and no hostility, makes 
strong relationships, and for whom 
homosexuality is a way of self-actuali- 
zation”; “a IO icono- 
clast who has libidinized all of his psy- 
chological needs in fellatio”; ‘a para- 
noid character who has never made u 

his mind whether he is a boy or a girl, 
a tormented, sad homosexual who flees 
from close relations and is at the mercy 
of his own intense emotions”; “an in- 
dividual with lots of strength as a man, 
who plays the assertive role with men, 
and with tremendous hostility to wom- 
en”; and, finally, just to bring this 
arbitrarily to a close, “a homosexual 
with pervasive feminine orientation, a 
very passionate person, an aggressive 
personality with a paranoid base.” 
The most striking finding of the three 
judges (3) who examined the projec- 
tive materials was that many of the 
homosexuals were very well adjusted. 
In fact the three judges agreed on two- 
thirds of the group as being average to 
superior in adjustment. Not only do all 
homosexuals not have strong feminine 
identification, nor are they all “some- 
what paranoid,” but, according to the 
judges, some may not be characterized 

y any demonstrable pathology! 

It will have become evident by this 
time that I am not greatly disturbed 
by the fact that projective techniques 
are not demonstrably valid means for 
diagnosing homosexuality. In fact I am 
rather encouraged by this, because I 
hope it will force us to re-examine the 
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much over-simplified picture we have 
had and encourage us to remind our- 
selves that the first goal of science is 
understanding, with prediction and 
control as secondary to it. It may not 
be inappropriate to quote here the 
opening sentence of the APA recom- 
mendations for psychological tests and 
diagnostic techniques (8, p. 1): “Psy- 
chological and educational tests are 
used in arriving at decisions which 
may have great influence on the ulti- 
mate welfare of the persons tested, and 
of the community.” There is probably 
no clinical area in which psychological 
tests are used for which that statement 
has greater import than that of homo- 
sexuality. 
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Validity and Methodology in Projective Tests 


NorMAN L. FARBEROW 
V.A. Center, Los Angeles 


The reflections in this paper stem 
from a study by Dr. Shneidman and 
me on “TAT Heroes of Suicidal and 
Non-Suicidal Subjects,” a report of 
which appeared in the June 1958 issue 
of the Journal of Projective Tech- 
niques (10). This was a study which 
started out as a Clinical investigation 
of suicidal individuals as inferred from 
their reactions to the TAT, but which 
ended up as an examination of the 
validity and methodology of the tech- 
niques used. This is not the occasion to 
go into details about that experiment, 
but in order to report these reflections, 
let me briefly review some aspects of 
the study. 

In essence, ninety-three TATs were 
gathered into seven groups which were 
then compared with each other. The 
categories consisted of three groups of 
males, normal, neurotic, and psychotic, 
who were non-suicidal; two groups of 
males, neurotic and psychotic, who had 
attempted suicide; one group of neu- 
rotic females who had attempted sui- 
cide; and one group of neurotic males 
who had completed suicide. The tech- 
nique used was one devised by Ira 
Friedman in which, instead of the 
usual procedure of rating for the at- 
tributes of the story-teller, the at- 
tributes of the TAT heroes of the 
TAT protocols were sorted (3, 4). 
These were rated—completely blind— 
by means of an 80-item, nine-step, Q- 
sort procedure by one judge, in this 
case, Dr. Friedman himself. 

It was our hope that the sorting for 
the attributes of the story-hero rather 
than for the characteristics of the story- 
teller would permit us to eliminate 
some of the harrassing problems of un- 
reliability and to come closer to grasp- 
ing some of the essential differences in 
the thematic productions of suicidal 
and maevantatiiel subjects. We do not 
wish to say that our analysis of the sta- 


tistical results were completely disap- 
pointing, but suffice it to say that on 
viewing the results one of us went into 
a sharp reactive depression. In reflect- 
ing back to our feelings at the begin- 
ning of the study, it seems that we co- 
vertly had the naive hope that the re- 
sults would be something as follows: 
that our reliabilities would be perfect; 
that the within-group correlations 
would be plus one, indicating com- 
plete homogeneity within each group: 
and that the between-groups correla- 
tions would be around zero, indicating 
that these homogeneous groups were, 
indeed, different from one another. We 
could then describe in some detail 
those clinically significant attributes 
which nature, the TAT, and statistics 
had shown to characterize each of the 
seven groups. Clinicians would then 
have new and practical clues to sharp- 
en our diagnostic acumen and to en- 
able them to distinguish among indi- 
viduals who have attempted suicide or 
who will commit suicide. Our IBM 
equipment, alas, betrayed us. It ap- 
peared that none of the seven groups 
was essentially homogenous and that 
while our judge was able to make es- 
sentially correct over-all judgments 
better than chance as to the sex of the 
subjects, and as to normal versus dis- 
turbed (but not suicidal) subjects, the 
differentiation between suicidal and 
non-suicidal status yielded no better 
than chance expectation. In thinking 
through the results we came to the con- 
clusion that the study had some inter- 
esting implications for the validity and 
methodology of picture thematic tests. 

We were faced. with the inescapable 
results that the TAT, as it had been 
employed in our study, could not dif- 
ferentiate between groups of suicidal 
and non-suicidal subjects. The possi- 
bility that there were no essential dif- 
ferences was a conclusion we could not 
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comfortably accept, having had the 
benefit of a great deal of clinical and 
empirical observation behind us from 
over a decade of intensive research 
pre-occupation with the subject. Our 
conclusion was that not only was the 
nature of the phenomenon we were 
attempting to measure so complex, but 
that the nature of the stimuli used to 
evoke such differences were either so 
varied, or even inappropriate, that the 
differences must, indeed, have been 
masked. 

In terms of the phenomenon, or bet- 
ter, the phenomena, of suicide, we felt 
we were under no misconceptions as 
to the complexity of the subject we 
were studying. To paraphrase Dr. 
Hooker, it is extremely difficult to! 
know who is suicidal until we know 
who the suicidal person is. As an ex- 
ample of the complexity, we need only 
point to such varied and desperate in- 
dividuals as the young teen-ager who 
blows off his head on his jilting girl- 
friend’s porch; the older man, wid- 
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omnipotent, and that we cannot rea- 
sonably expect universal and eternally 
perfect correlations with aspects of na- 
ture we have only incompletely con- 
ceptualized. 

Our questions at this point were old 
familiar ones. For example, we even 
debated the basic — of whether 
it was possible to identify or to predict 
an overt act or the state of mind which 
produces such behavior, from fantasy 
productions aroused in a special set of 
conditions, conditions which were usu- 
ally very carefully arranged before- 
hand to eliminate any behavior other 
than the production of fantasy. I will 
not burden us with reports of the ex- 
periments on the relationships be- 
tween test performance and behavior 
other than to remind us that they have 
been both confusing and frustrating. 
Dr. Bert Forer, Editor of the Journal 
of Projective Techniques, expressed es- 
sentially our feeling when, in com- 
menting on the papers in last year’s 
Symposium on Research in Projective 


owed and with a terminal disease, who?| Techniques, he said, “There is a good 
gently slashes his wrists and lied down’? deal of psychological substance be- 


in bed slowly to bleed to death; or the 
schizophrenic girl who attends to an 
hallucinatory voice that the first step 
down from the sixth story window of 
the hospital is not so high after all. 
For still more examples, we may look 
among the attempted suicides, where 
we may have the aborted successful at- 
tempt in which the individual is saved 
only by a freakish accident, or the gen- 
uine attempts with various degrees of 
a “gamble with death” (11), or the 
gestural attempt, where the intent to 
die is completely absent. It becomes 
obvious that the motives or the pat- 
terns of motives may run the gamut of 
those previously conceptualized, such 
as those offered by Durkheim, Freud, 
Jung, Menninger, Kelly, and others, 
and includes many, we are sure, yet to 
be conceptualized, we hope, by us and 
by others working with this problem. 
In our search for further constructs in 
the understanding of suicide, our dis- 
appointment served to remind us 
again that neither we nor our tests are 


tween drives and behavior.” Neverthe- 
less, while we are aware that the ex- 
pression of a motive in one context or 
at one level of psychological organiza- 
tion has not been too highly correlated 
with its manifestations at another psy- 
chological level, this basic assumption 
must be made or indeed there can be 
no justification for projective tests. In 
other words, there must be an essential 
relatedness or equivalence of behav- 
iors; the task then becomes one of find- 
ing behaviors with the highest equiv- 
alence. This becomes simply another 
way of stating the basic problem of val- 
idity: will the test data yield meaning- 
ful information about the behavior be- 
ing examined. 


It becomes apparent from the above 
that in the use of projective tests the 
nature of the stimuli employed in the 
investigation of any set of phenomena 
assumes proportions of major impor- 
tance. This attention to the nature of 
the stimuli would fall within that spe- 
cial aspect of validity suggested by Dr. 
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Shneidman in his five-part delineation 
of the problem of validity, specifically, 
Validity For What. 

In reflecting on the nature of the 
stimuli we had employed in our study, 
the closely related questions of glo- 
bality and of relevance seemed impor- 
tant. The evaluations of an entire 
TAT protocol in terms of composite 
judgments for the identification or pre- 
diction of particular kinds of behavior 
have generally led to poor results, The 
TAT, like most projective tests, was 
conceived of as a global test, yielding 
many data about many facets of per- 
sonality, with, as we have painfully 
learned, much depending upon the 
skill of the clinician-interpreter. ‘To- 
day, much of the trend seems to be to- 
ward devising instruments for measur- 
ing the individual constructs which go 
toward making up the global person- 
ality, a trend to which we wish to add 
our voice. Cronbach and Meehl in 
their article on Construct Validity in 
the Psychological Bulletin in 1955 (2) 
stated that global interpretations re- 
quire validation of the separate con- 
structs implied or inferred in each sep- 
arate statement about an individual. 
Bindra in his article entitled ‘“Psycho- 
therapy and Recovery from Neuroses” 
(1) suggested that globality may be all 
right for the clinician, but that the re- 
searcher must concentrate on “Truth” 
in small portions, chewable bites, and 
digestible bits. Kelly, in his review 
chapter on assessment in the 1955 An- 
nual Review of Psychology (6), lists 
tests for individual constructs, such as 
anxiety, dependency, and other famil- 
iar terms. In addition, the relevance of 
a test, such as the TAT, or Rorschach, 
for measuring a phenomenon also rests 
squarely in the middle of validity, the 
Validity of What question. The TAT, 
by the very nature of its construction, 
taps a wide range of many facets of 
personality and could not hope to en- 
compass an adequate appraisal of the 
polarities of each of the psychological 
constructs invoked. Other tests meas- 
uring such polarities within the indi- 
vidual constructs are needed, not to 


supplant the TAT, but rather to sup- 
plement it. 

At the beginning of the present 
study it seemed to us that the tech- 
nique of evaluating the attributes of 
the hero offered an unusual opportu- 
nity to stick close to the given data and 
thus to obtain interjudge reliabilities 
based on phenotypes. What we had not 
anticipated was that one cannot collate 
or average or add heroes, Consider: the 
clinician’s usual diagnostic task is com- 
parable to seeing a play, for example 
Hamlet, and then divining Shakes- 
pe personality attributes. In the 

ero-oriented approach, the task is to 
see Hamlet and then describe, in some 
systematic way, Hamlet’s personality 
characteristics. But, if on a single day 
the clinician were to see Hamlet, 
Love’s Labor Lost, Othello, Measure 
for Measure, King Lear, and All’s Well 
That Ends Well—how could he then 
give a single description of the chief 
protagonist of all these plays? Unfor- 
tunately, this was the judge’s task in 
the present study. 

The recommendation at this point is 
an obvious one (and one which we in- 
tend to pursue) ; namely that the pro- 
tocols be separated into their respec- 
tive stories and judgments be made 
across the protocols in terms of single 
stories, that is, all the heroes described 
in the stories given to Card 6BM would 
be judged, the heroes of 7BM, 13MF, 
etc., and then the correlations could be 
made. Further, we would intend to de- 
sign specific sets of picture thematic 
stimulus materials, to investigate hos- 
tility feelings, suicidal feelings, etc. 

Our suggestion, then, is for a modifi- 
cation of the stimulus data, using the 
TAT principle. We all know that the 
TAT has been manipulated in several 
ways. The first way that occurs to one 
is that Murray himself devised the 
TAT materials so that there was con- 
tained within them a continuum be- 
tween highly structured and highly 
ambiguous stimuli. The second way in 
which the TAT principle has been 
manipulated is by variation of the con- 
tent of the test materials for particular 
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purposes, i.e. the creation of special 
sets of TAT like cards for specific kinds 
of subjects. There are, for example, 
TAT modifications for Negroes, for 
Navajoes, for Utes, for paraplegics, for 
Michigan adolescents, Korean prison- 
ers-of-war, and even for some animals 
like the CAT. What we want to sug- 
gest is a third dimension along which 
the TAT principle can be varied; that 


is, for the measurement of specific beg 


havioral and emotional constructs. We 
believe that what seems to be called for 
are : ye fic sets of projective materials 
which are devised especially and used 
experimentally to explore the particu- 
lar continua of personality functions. 
Specifically, we suggest, that special 
sets of picture thematic materials be 
constructed along such continua as 
hostility, homosexuality, passivity, sui- 
cidal potential, etc. This suggestion is 
in line with previous work by Little 
and Winder (8) , and by Lesser (7) . In 
concentrating on the area of interest to 
us, we envisage the construction of a 
set of cards which would fall along the 
continuum of suicidal-fantasy evoking 
stimuli, in which is contained one card 
(among several) that has for practical- 
ly all subjects no suicidal card-pull, 
and at the other end another card that 
has for practically all subjects a pre- 
potent suicidal card-pull. As Murstein 
has similarly stated in his article “Non- 
projective Determinants of Perception 
on the TAT” (9), it is important to 
know the non-projective determinants 
of perception on any stimulus so that 
the amount of projection may be eval- 
uated. Suicidal fantasy on one card 
might indicate considerable distortion 
of the stimulus properties, while on an- 
other card reflect only considerable re- 
spect for the objective stimulus. Our 
interest in administering such a series 
to a subject would be in his limen or 
threshold—at which card did he begin 
to talk about suicide—as well as in his 
total suicidal score. We are exploring 
the possibility of including within the 
scale some other measure which might 
yield an index of the possibility of act- 
ing on the suicidal tendencies indicat- 
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ed, or, to give it a psychoanalytic con- 
ceptualization, evaluate the adaptive 
aspects of ego functioning. The specific 
determinant might be anxiety (con- 
struct: add anxiety and one gets agita- 
tion, a significantly dangerous suicidal 
state) or guilt (construct: guilt cou- 
“ee with fear stimulates expiatory 

opes and activity, also a significantly 
dangerous suicidal state.) 


These suggestions are not meant as 
“practical” in the sense of our rushing 
to construct another test for the Psy- 
chological Corporation catalogue. Dr. 
Friedman, in his critique of our paper 
published in the September 1958 issue 
of the Journal of Projective Tech- 
niques (5), after indicating that our 
suggestion appears to be an approach 
which might reduce the thorny prob- 
lem of collating, averaging, or adding 
heroes as well as an attempt to put sub- 
jects on the same measuring scale with 
reference to particular personality var- 
iables, then goes on and lists some mis- 
givings about this notion, which es- 
sentially have to do with quantifica- 
tion of the data as opposed to the more 
global approach. Parenthetically, we 
might say that Dr. Friedman sent us a 
copy of his Critique before it was pub- 
lished, and we were asked if we wished 
to answer it. We were hardly in a po- 
sition to rebut his paper inasmuch as 
we agreed with (or had stated) every- 
thing which he said. However, we see 
no reason why we should not pursue 
this avenue of approach, if for no other 
reason than to see where it might lead. 
In this sense, we find the current ef- 
forts in the field of thematic testing 
vastly exciting in that they continue to 
suggest new procedural techniques as 
well as new conceptualizations of hu- 
man behavior which may be useful in 
the exploration of personality. 
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Problems in the Validation of Projective Techniques 


KENNETH B. LITTLE 
National Institute of Mental Health, Bethesda, Maryland 


We are discussing today the general 
issue of the validation of projective 
techniques. However, I will limit my 
remarks to the effective validity of 
such instruments rather than attempt 
to cover all forms of validation. “Ef- 
fective validity” may be subsumed 
under predictive validity but is re- 
stricted to the class of useful predic- 
tions rather than any and all predic- 
tions. 

In the clinical situation, where most 
projective techniques are adminis- 
tered, validity has a most pragmatic 
air. The construct validity of the in- 
strument may be of theoretical interest 
to the psychologist and the concurrent 
validity an issue of some curiosity, but 
as far as the case in hand is concerned, 
one usually wishes to make meaning- 
ful and useful predictive statements 
about a patient. When one administers 
a Rorschach or a TAT to such a per- 
son, the implicit or explicit questions 
are of a rather prosaic sort: “Will this 
patient attempt suicide?”, “Does he 
have some form of organic impair- 
ment?”, “Would psychotherapy be an 
appropriate form of treatment?”, and 
the like. The questions are those the 
psychologist, as a psycho-diagnostician, 
was hired to answer. 

Other information may be, and usu- 
ally is, provided in the test report. And 
certainly there is little expectation that 
the answers to the questions will reach 
such a degree of specificity as, “This 
patient will jump out of the ward bath- 
room window at 5:54 a.m. on Thurs- 
day.” The report should include, how- 
ever, statements about the subject that 
raise the probability of making correct 
assumptions as to his future behavior. 
The predictions involved may be quite 
indirect and dependent upon fairly 
complex chains of inferences but if 
personality assessments are to be other 
than exercises in composition, some in- 


formation with implications for be- 
havior must be included, It is the ac- 
curacy of the significant predictions, 
the correctness of the answers to the 
perennial questions, that may be called 
effective validity. 

Aiter this preamble, it is somewhat 
embarrassing to have to say that the 
published evidence on projective tech- 
niques indicates that they have either 
zero or, at best, very low positive effec- 
tive validity indices. Even in those 
studies with the most positive of re- 
sults, correlations are of an order of 
magnitude which make predictions for 
the individual largely a waste of time. 


This distressing state of affairs has 
been with us unchanged for a rather 
long period of time. During the 35 or 
so years in which, for example, the 
Rorschach has been subject to syste- 
matic investigation, the only notice- 
able change has been in the sophisti- 
cation of design and analysis. Negative 
or ambiguous results, however, are just 
as disheartening when determined by 
an F ratio as when determined by a 
critical ratio. 

The situation as described has been 
pointed out by several people, e.g., 
Cronbach, (4) so I will not spend time 
in citing evidence. Instead I would like 
to consider the questions of “Why?” 
and “What do we do now?”. A defini- 
tive answer to either of these would 
still require far more time than is 
available in this paper so I will per- 
force be quite arbitrary about what I 
present. 

An answer to the question of why 
our projective techniques fail so mis- 
erably at practical predictions is obvi- 
ously required before any rational so- 
lutions can be proposed. However, the 
determinants involved in the failure 
are numerous and many and quite 
difficult to eliminate. The fact that be- 
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havior is a function of both the organ- 
ism and the situation means that any 
prediction based upon evaluation of 
the former alone will always be a con- 
tingent one. Since the number of pos- 
sible alternative stimulus situations is 
astronomical the problem of raising 
the validity of our instruments seems 
an impossible one. Fortunately hu- 
mans as wel] as rats are capable of, and 
facile at, generalization, so it is possible 
to reduce the number of stimulus situ- 
ation categories sharply. We are able 
to say that one man’s candy is another 
man’s brandy and call both consum- 
matory responses, or oral behavior, etc. 
This inevitably leads to a loss in pre- 
cision in the prediction of specific be- 
havior so we find ourselves anticipat- 
ing alcoholism when obesity in fact 
occurs. However, correlations of less 
than unity are a fact of life with which 
psychologists have already learned to 
live. 


However, there are other issues hav- 
ing to do more directly with the assess- 
ment process and with the instruments 
used. In our limited time, I wish to talk 
about two of these, one quite specu- 
lative and the other rather concrete. 
The speculative issue has to do with 
the interpretation of the basic postu- 
late of projective testing, that of psy- 
chic determinism. The term “inter- 
pretation” is used deliberately since 
few people now argue that personality 
factors have no effect whatsoever on 
responses to projective technique stim- 
uli. The rub occurs when we assume 
that all or a majority of the responses 
are determined by dominant and en- 
during characteristics, equally oper- 
ative in the every day life of the indi- 
vidual as in the testing situation, The 
experienced tester allows for set, de- 
fensive factors unique to the particu- 
lar situation, and the like in his inter- 
pretations. Then at this point he often 
assumes that what remains represents 
the regnant motives, ego-protective 
mechanisms, behavior style and so 
forth that characterize the individual 
in all situations. I wonder if this is 
necessarily the case. Perhaps all that is 
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revealed is a micro-personality of 
limited extent. 

This amounts to a questioning of 
the unity of the personality, its Gestalt 
quality. It does not deny psychic de- 
terminism but merely raises the query 
as to whether that aspect of the 
psyche involved in responding to semi- 
structured ambiguous stimuli is of any 
practical significance in the rest of 
man’s behavior. During great stress 
when the individual is fighting for 
physiological or psychological survival, 
all forces are mobilized toward a com- 
mon end and common response pat- 
terns may be apparent in all or most all 
situations. Under less extreme condi- 
tions perhaps the individual is far 
more differentiated than we expect, 
with a wide selection of response roles. 
Even in physiologic functions such an 
effect may be observed; systolic and 
diastolic blood pressure, for example, 
correlate .60 in subjects suffering from 
combat fatigue but the relationship 
drops to .06 after recovery (10) . A sim- 
ilar phenomenon may occur among 
psychic functions. 

The above, as stated earlier, is obvi- 
ously speculative; there are, however, 
specific aspects of our assessment in- 
struments towards which we can point 
with some assurance in the attempt to 
determine why they fail. These have 
been examined at some length by An- 
astasi (1), Cronbach (4), MacFarlane 
and Tuddenham (7), and others, so I 
will discuss only the sampling prob- 
lem. 

In the Rorschach one has ten stimuli 
and from ten to sixty or so responses; 
in the TAT twenty pictures eliciting 
hopefully twenty stories; in other pro- 
jective —— a varying number of 
stimuli but of a similar order of mag- 
nitude. With these stimuli we secure a 
small sample of the behavior of the in- 
dividual and assume it is representa- 
tive of his total population of imag: 
inative responses. Then to make life 
even more difficult we attempt to 
“measure” five to seventy-five different 
variables with this one sample. The 
wise tester soon learns to restrict his 
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assessment endeavor but typically we 
find that we are still assigning values 
to the variables we do use on the basis 
of one or two responses. Since these 
few responses are simply a sample of 
a number of potential responses to the 
stimuli, the reliability of the measure- 
ment is quite questionable. The pro- 
jective technique interpreter is in a po- 
sition analgous to that he would be in 
if one took a random sample of 50 
items from the MMPI, permitted the 
subject the option of answering them 
in any fashion he chose (including not 
answering them at all) and then asked 
the psychologist to describe the MMPI 
profile. Depending upon the sample of 
items, the subject’s choice of answer 
categories, and the tester’s knowledge 
of the MMPI scales, there might be an 
occasional fairly correct reproduction 
of the whole profile, a slightly more 
frequent fair approximation of one or 
two scale scores and, most probably, 
an overall accuracy index that resem- 
bled strongly the validity co-efficients 
for projective techniques. 

Since others on the symposium have 
discussed this point also, I would like 
to turn to my second question, i.e., 
what do we do now. This discussion 
I will limit to some suggestions regard- 
ing increasing the precision of meas- 
urement of projective techniques. 

There are several different ways in 
which this might be done—all relative- 
ly painful. The' first, which continues 
the idiographic global approach, 
would be simply to increase the length 
of the current instruments — a Ror- 
schach of 50 cards, a TAT of 100, for 
example. It is theoretically pars in 
this fashion to secure enough replicat- 
ed information about a subject to de- 
termine the interrelations among per- 
sonality characteristics and to enable 
us to specify the links between person- 
ality structure and behavior. 

I do not have much hope for such 
a procedure; increasing the size of our 
global instruments will increase pro- 
portionately the amount of irrelevant 
information to be handled, placing a 
still greater load on the intellectual 
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and experience level of the clinician. 
Unless we are to increase training time 
indefinitely it is almost mandatory that 
our instruments do a certain amount 
of pre-digesting for us, sacrificing cov- 
erage for precision. As Tomkins and 
Miner describe it (9) we must be pre- 
pared to throw away information in 
order to raise the probability of cor- 
rectness of that we do obtain. 

If we adopt such an austerity pro- 
gram, there appear to be two modes 
of approach which might be used. The 
first way might be considered analo- 
gous to the item analytic techniques 
used in objective test construction. 
The literature on the existing tech- 
niques can be searched for stable and 
hopefully useful determinants of re- 
sponses to projective stimuli and spe- 
cial scales constructed for their meas- 
urement. After construction of the 
scale, there will be a long and tedious 
period of establishing the empirical 
correlates of the scale. A good example 
of this method is Barron’s (2) M scale 
for measuring the human movement 
response to ink blots. If such single de- 
terminant scales are too molecular, 
then additional scales can be construct- 
ed for configural combinations; the es- 
sential point is that there be sufficient 
stimuli for reliable measurement. 

The second approach is the construc- 
tion of projective instruments designed 
to measure specific personality vari- 
ables within certain theoretical frame- 
works. The Blacky Test (3) incorpo- 
rates this idea to some extent but still 
attempts to measure too many vari- 
ables with too few items. For precision 
of measurement, a minimum of about 
five reasonably independent projective 
“items” seems essential. Some of the 
sentence completion tests (Forer (5) 
and Rotter and Rafferty (8) ) might be 
considered illustrations of this ap- 
proach. McClelland’s use of the TAT 
pictures for assessing Need Achieve- 
ment also is a method of measurement 
that can be considered of this type. 

A final example of a thematic pro- 
jective technique devised for measure- 
ment of a specific personality charac- 
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teristic is one developed by C. L. Win- 
der and myself, It is similar in its gen- 
eral construction to an instrument de- 
veloped quite independently by Lesser 
(6). This test which has no formal 
title is designed to tap two aspects of 
the fantasy life of boys—fantasy ag- 
gression and fantasy aggression anxi- 
ety. In constructing it we started with 
100 pictures drawn by an artist to rep- 
resent a variety of boy-boy, boy-girl, 
and boy-adult interactions and with 
varying degrees of pull for aggressive 
response. The subjects wrote stories to 
the pictures under general TAT type 
instructions. Reliability of scoring is 
better than .92 and split half reliabili- 
ties (alpha coefficients) range between 
.71 and .88 depending — the type 
of aggression fantasy, e.g. boy-boy, boy- 
adult, being assessed, Since this is but 
the first round of refinement we hope 
to increase those reliability figures 
somewhat. 


Such instruments as this last demon- 
strate very well the advantages and 
disadvantages of the projective inven- 
tory approach. A tremendous amount 
of information is sacrificed in the in- 
terest of precision, without, it should 
be added, with any guarantee that the 
resulting scores will be any more use- 
ful for predicting behavior than the 
global assessment. But since the global 
methods do not appear to work, we 
must, of necessity, try something else. 
Such inventories, moreover, do not per- 
mit the integration of multiple factors 
into a unique personality description, 
at least when administered singly. 
However, since it is my belief that 
unique descriptions that do not gen- 
erate accurate predictions are merely 
works of art and not clinically mean- 
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taking. 
ingful I think the gamble is worth 
There are other experimental vari- 
ations of projective techniques such as 
the PAT (9) which time does not per- 
mit me to discuss. They represent, I 
think, a very healthy trend toward ex- 
ploration of all possible methods for 
personality assessment. If projective 
techniques are to be other than ex- 
amples of the ingenuity of psycholo- 
gists and their interpretation other 
than a measure of the clinicians verbal 
ability, then some modification of our 
present instruments is essential. 
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Further Check on the Diagnostic Validity of the Ames Danger Signals 


LouIsE BAaTEs AMES 
(With the assistance of MARJEAN KREMER) 
Gesell Institute of Child Development 


An earlier study of child Rorschach 
responses (1) has demonstrated that 
many kinds of response customarily 
considered as indicative of disturbance 
or even of pathology in the adult, oc- 
cur not only benignly but even nor- 
matively in the child subject. Particu- 
larly at such ages as 314 and 7 years, 
the apparently typical Rorschach re- 
sponse includes several conspicuous 
features which would suggest disturb- 
ance in the adult. 

Nevertheless, in spite of the appar- 


_ent normality of many traditional 


danger signs when such responses ap- 
pear in the child’s record, we know 
that the clinician who works with chil- 
dren does depend on the Rorschach for 
clues as to the nature and degree of 
disturbance in presumably disturbed 
child subjects. Thus we face the prob- 
lem — which kinds of responses cus- 
tomarily considered to be danger sig- 
nals actually do characterize the re- 
sponse of children judged on other 
measures to be “disturbed” children. 
We ourselves tentatively identified a 
list of thirteen signs which seemed to 
us potentially indicative that a sub- 
ject was in real difficulty. The first at- 
tempt at validating this list of danger 
signals is that of Elkins (3). Of the 
thirteen “Ames danger signals” sug- 
gested by us, Elkins me ae four as 
involving subjective judgments, and 
then increased the number of signs to 
be tested to a total of thirteen because 
of the necessity of treating the parts of 
four complex signs as separate signs. 
Her list, then, included the following: 
1, Preponderance of tree or leaf responses 
(Preponderance defined as over 50% 
of R). 
2. F0% exceeds 75. 
3. More than one pure color response. 
5A. More than one response involving differ- 
entiated shading. 


5B. More than one response involving dif- 
fuse, dysphoric shading, i.e. Clob. 
6. More than one inanimate movement re- 
sponse. 
8. W% less than 40. 
9A. Dd& over 15. Dd% equals Dd, S, and Do 
responses over total R. 
9B. More than one X response. 
9C. More than one S response. 
10A. AY over 60. 
10B. No human response (i.e. no H; Hd not 
counted) . 
11. R less than 10. 

Elkins selected a group of forty pre- 
sumably normal 8, 9 and 10 year olds 
matched for sex, age and I.Q. with a 
group of forty children who had been 
referred to the Institute for Juvenile 
Research in Chicago as being in need 
of child guidance services. The Ror- 
schach was given to each of these sub- 
jects. 

Following careful‘procedures, Elkins 
checked these records for presence or 
absence of the suggested “danger 
signs”, and then compared the total 
scores for the clinic and nondisturbed 
groups, by means of a variation of Mc- 
Nemar’s Formula 26 for the evaluation 
of the difference between two corre- 
lated means. The resulting figures 
showed that “the clinic, or presumably 
disturbed, group had significantly few- 
er danger or maladjustment signs than 
did the nondisturbed group.” 

A more detailed analysis by individ- 
ual signs indicated that “most of the 
Ames danger signals are not useful for 
differentiating between clinic and non- 
disturbed samples of 8 to 10-year-olds 
because nearly half of the signs are 
either characteristically present in a 
sizeable proportion of both normal 
and clinic cases, or else are almost 
never found in the records of either... 
The most promising signs which dis- 
criminate in the expected direction are 
pure C > I, more than one diffuse, 
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dysphoric shading response, and m > 
1. These signs, like contamination for 
schizophrenia, are apparently diag- 
nostic when they occur, but have 
such limited incidence as to be of 
little use for routine screening pur- 
poses.” 

Such extremely negative findings 
suggest to us two possibilities. The first 
is that even to a greater extent than we 
had suspected, the kinds of Rorschach 
response often considered, especially in 
the adult, to be indicative of disturb- 
ance, have a benign significance when 
they occur in the records of children. 
(Our earlier study had, in our opinion, 
ruled out as danger signs many kinds 
of response often considered to be dan- 
gerous, but we had nevertheless offered 
tentatively as possible danger signals 
the thirteen signs tested by Elkins.) 

An alternative possibility, and one 
which we favored, was that Elkins’ 
clinic population of forty subjects, 
whose records did not show a signifi- 
cant number of our suggested danger 
signs, were perhaps not seriously dis- 
turbed children. Elkins herself feels 
convinced that “the clinic population 
tested represented a really disturbed 
group”. In fact she is more convinced 
of this than that her “nondisturbed 
group was an ideally normal popula- 
tion.” 

However, it has been our finding in 
clinical practice that a large majority 
of children brought to us for clinical 
help, are not what we consider serious- 
ly disturbed children. 

The present study duplicates Elkins’ 
investigation, with the possible excep- 
tion that we took great care that our 
disturbed population was made up 
only of subjects whom we had previ- 
ously classified as clearly disturbed, 
and that we used boys only. 


SUBJECTS AND METHODS 


Records of all clinical cases between 
the ages of 6 and 12 years, seen at the 
Gesell Institute of Child Development 
during the five years from June, 1953 
to June, 1958 were reviewed by means 
of reading over case-conference notes. 
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All cases where the case-conference 
exclusive of the Rorschach, which was 
not considered, indicated “emotional 
instability”, “emotional disturbance”, 
“need for therapy”, were then checked 
with the summary of these cases writ- 
ten by Dr. Frances Ilg, Director of the 
Clinical Service. Only those cases were 
selected which were labelled by Dr. Ilg 
as Clearly disturbed. 

Since in this total group of cases, 
records of only six girls were included, 
it was decided to use boy subjects only. 
In order to make up a total of fifty 
cases, records from the six month 
period from January to June, 1953 
were included. This final total of fifty 
subjects whom Dr. Ilg considered to 
be clearly unstable or disturbed, were 
thus chosen from a total of approxi- 
mately three hundred records of boys 
of ages between 6 and 12 years. 

Only children with I.Q.’s rated as 
average or better were included. 

A control group of fifty presumably 
normal subjects was chosen from 
among our normal research popula- 
tion as follows. All disturbed cases were 
checked for age, I.Q., and father’s occu- 
pation. Then a research assistant 
(MJK) selected from among our nor- 
mal population cases which matched 
the disturbed cases for age, I.Q. and 
father’s occupation. 

SES and intelligence ratings for the 
two groups are given in Table I. Simi- 
larity of the two groups is marked. The 
largest numbers of subjects in both 
groups are from the professional or 
semi-professional classes. Only 7 of the 
disturbed subjects and only 8 of the 
normal group fall below Class III, that 
of clerical, retail business or semi- 
skilled occupations. As to intelligence, 
largest sanders in both groups fall in 
the average category, next most in high 
average. Only 8 of the disturbed sub- 
jects, only 6 of the normals are unclassi- 
fied — thus possibly below average. 

During the selection of these two 
groups, the normal and the disturbed, 
the Rorschach records were not con- 
sulted nor considered. That is, neither 
the “disturbed” nor the “normal” sub- 
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TABLE I—Socio-economic Status and 
Intelligence of Subjects 


No. normal No. disturbed 
Class subjects subjects 
Socio-economic status 
I 16 16 
II 15 16 
III ll ll 
IV _ _ 
Vv 5 6 
VI 1 _ 
VII — — 
? 2 1 
Intelligence ratings 


Very superior 1 2 
Superior 9 7 
High Average 15 9 
Average 19 24 
Unclassified 6 8 


jects were selected on the basis of their 

Rorschach performance. Once selected, 

however, both groups of cases were 

then checked for the occurrence of 
possible “danger signals” in the Ror- 
schach records. Our original 1952 list 
which, in a slightly modified version 

(3, p. 283) was tested by Elkins, is 

as follows: 

1. A preponderance of leaf or tree responses, 

particularly if given perseveratively, after 
4 or 5 years of age. Also static persevera- 
tion of any simple one-word response. 

. After 5 years of age, more than 75%F. 

. More than one pure C is suspect. 

. Color naming is suspect after 5 years. 

More than one shading or clob in a nor- 

mal length record. 

6. More than one m. 

7. Contamination after 514 and confabula- 
tion after 7. 

8. Less than 40 to 50% W. 

9. More than 10 to 15% Dd. Especially more 
than one Do or more than one S. 

10. Over 60%A. Less than one H. 

ll. After 31% years, a total of less than 10R. 

12. Position responses. 

13. Other behaviors which are at least suspect 
are the giving of extremely bizarre re- 
sponses, inability to cope at all with the 
test, excessive manipulation, or particu- 
Jarly odd manipulation. 


Subsequent experience with the 
Rorschach test in children has led us to 
make the following modifications of 
this list: 

1. “Preponderance of leaf or tree responses” 
was modified to read, “Three or more leaf 
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or tree responses, or 15% or more of total 

content.” 

2. 75% F or more was changed to the proba- 

bly more significant F% >F+%. 

. “One or more C” was changed to “Any C 

or color naming.” 

4. “More than one shading or Clob in a 
normal length record” was modified to 
“Any variation of Clob or C’.” 

5. ‘‘More than one m” was changed to “Three 
or more m orm > M + FM. 

6. Area was omitted as a criterion except for 
“25% or more Dd or more than one Do.” 

7. “Over 60%A” was raised to “Over 70%A.” 

8. ‘‘Less than one H” was changed to “No H; 
or Hd: H is 2:1.” 

The following items were added: 

1. “Elimination, sex or reproduction re- 
sponses.” 

2. “Three or more anatomy, or 15% or more 
of the total content; or F— anatomy.” 

3. ‘‘Personal reference or more than one re- 
sponse suggesting trouble, as old, broken, 
decayed, damaged objects.” 

With these, modifications, we arrive 
at the list of possible “danger signs” 
listed in Table II. This list, because of 
these changes, is slightly different in 
detail from the list which Elkins at- 
tempted to validate. However, it is 
similar; and since our aim is to dis- 
cover what kinds of Rorschach re- 
sponses are danger signals, and not 
merely to “defend” our original list, 
these variations in the original list 
seem permissible. 

Table II lists all proposed danger 
signals and indicates the number of in- 
stances, out of 50 normal and 50 dis- 
turbed subjects, paired, in which both 
of the pair, neither of the pair, only 
the disturbed subject or only the non- 
disturbed subject showed the sign. It 
also indicates what percent of each 
group show each sign, and includes a 
statistical measure of significance. 

Mean number of these proposed 
signs in our normal group is 1.6; in our 
disturbed group, 5.0. There is a statis- 
cally significant difference (t=15.90) 
between these figures. However, as 
Table II shows, some of the individual 
measures within this list are more ef- 
fective than are others in distinguish- 
ing between normal and disturbed 
subjects. 


Ow 








TABLE I]—Incidence of “Danger Si 
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als” in Matched Pairs of 


Clinic and Nondisturbed Subjects 


Sign occurs in 
Both of Neither Only non- Only 


Percentage of occurrance 


Chi- on-dis- 

Danger signal pair ofpair disturbed clinic square* turbed Clinic 
la. Less than 10R 5 31 4 10 1.79 18% 30% 
lb. Excessive 

manipulation 0 38 0 12 10.08** - 24%, 
2a. More than 25% Dd 1 41 3 5 12 8% 12% 
b. More than 1 Do 1 39 2 8 2.50 6% 18% 
3. F% exceeds F 4+ % 2 29 2 17 10.31** 8% 38% 
4. 3 or more m; or 
m=or >M+ FM 0 36 4 10 1.79 8% 20% 
5. lormoreC orCn 4 19 8 19 3.70 24%, 46%, 
6. 70%A or more 0 36 8 1] 3.50 6% 22% 
7. NoH; or Hd:H = 
2:1 or more 4 27 3 16 7358°° 14%, 40%, 
8. Static perseveration 0 35 0 15 13.06* - 30%, 
9. Contamination 0 33 1 16 Hia* 2% 32%, 

10. Confabulation 2 28 3 17 8.45°° 10% 38%, 

11. Position response 2 42 0 6 = a 4% 16% 

12. Leaf or tree — 

3 or 15% of content 0 43 2 5 53 4 1 
13. Elimination or sex 1 43 0 6 4.17% oe ae 
14. Anatomy — 3 or more 
or 15%; or F— 
anatomy 0 37 6 i 00 12% 14% 
15a. Bizarre content 3 26 1 20 15.43** 8% 46%, 
b. Personal reference 0 41 1 8 5.33" 2% 16% 
c. Troubled content 4 18 4 24 1250°* 16% 56% 
16. Dysphoric shading: 
variations of Clob 
or C’ 0 39 3 s 1.45 6% 16% 


* Corrected for continuity (McNemar’s Formula 86a). 


» Less than 10 subjects. 
* Significant at .05 level. 
** Significant at .01 level. 


As this table indicates, 7 of our pro- 
posed measures do show a significant 
difference between normal and dis- 
turbed groups either at the .05 or the 
.01 level. Two others show significance 
at the .05 level though there are fewer 
than ten subjects involved, which 
makes the measure of significance of 
questionable value. Two further signs 
— | or more C or Cn, and 70%A or 
more — fall just below the .05 level of 
significance. 

This gives a total of 11 of the ond 
posed 16 measures which either show 
statistically significant differences be- 
tween the two groups, or differences 
which are close to statistical signifi- 
cance. This list includes the following 
signs which do appear to a statistically 
significant extent to distinguish dis- 
turbed from non-disturbed subjects: 


1. Excessive manipulation. 
2. F% exceeds F+%. 
+3. One or more C or Cn. 
+4. 70%A or more. 
5. No H, or Hd: H is 2:1 or more. 
6. Static perseveration. 
7. Contamination. 
8. Confabulation. 
*9. Position response. 
*10. Elimination or sex response. 
11. Bizarre content; personal reference; 
troubled content. 
+ Falls just below the level of statistical signi- 
ficance at the .05 level. 
* Fewer than 10 instances. 


Of the proposed danger signs, that is 
of the kinds of response which we have 
been inclined to consider as possible 
trouble indicators, the following did 
not occur significantly more in our dis- 
turbed than in our normal population: 
1. More than 25%Dd or more than one Do. 
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2. Three or more m. 

3. Three or more leaf or tree responses. 

4. Three or more anatomy responses or F— 
anatomy. 

5. Dysphoric shading. 


NUMBER OF DANGER SIGNS PRESENT 


Perhaps equally important or even 
more important than the actual nature 
of disturbance signs in any given rec- 
ord, in detecting emotional disturb- 
ance on the part of the subject, is the 
number of such signs present in any 
one record. 

Table III which presents the fre- 
quency of occurrence of proposed 
danger signs, and Table IV which 
gives the frequency of those which 
turned out to be significant, and also 
the cumulative percentages of distri- 
bution, show clearly the quantitative 
differences in number of signs present 
in normal and in disturbed records. 
Only two out of 50 normal subjects 
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have more than three of the initially 

roposed danger signs, and more than 
fal have either one or none. Con- 
versely, only seven of the disturbed 
subjects have as few as three of the 
proposed signs. By percentages, only 
4%, of the normal group have four 
or more signs, whereas 86% of the 
disturbed subjects have four or more. 
Mean number of these signs in the 
normal group is 1.6; in the disturbed 
group, 5.0. The difference is statisti- 
cally significant. 

Considering merely those signs 
which turned out to occur with a statis- 
tically significant frequency more often 
in the disturbed than in the normal 
group, equally conspicuous differences 
exist. Only five normal subjects have 
more than one sign, whereas only two 
of the disturbed subjects have less than 
two. By percentages, only 10% of the 
normal group have two or more signs, 
whereas 90% of the disturbed have two 


TABLE mathe wee of Occurrence of Proposed Danger Signs 


and 
Number of subjects giving each 
number of signs 
No. of Signs* Normal Disturbed 
0 6 0 
1 22 0 
2 12 0 
3 8 7 
4 1 13 
5 0 13 
6 0 7 
7 }» 7 
8 0 3 
Mean 1.6 5.0 (t=15.90) 


umulative Percentage Distribution 


Cumulative percentages 


No. Normal Disturbed 
1 or more 88 100 
2 or more 44 100 
3 or more 20 100 
4 or more 4 86 
5 or more 2 60 
6 or more 2 34 
7 or more 2 20 
8 or more 0 6 


* This counts one score to a numbered sign even though a and b may both be present. 
* A 7-year-old boy now seems probably disturbed, but who was included in our initial study as 
coming from a regular school group and judged normal by the teacher. 


Tasie IV—Frequency of Occurrence of Significant Danger Signs 
and Cumulative Percentage Distribution 


Number of subjects giving each 
number of signs 


No. of Signs Normal Disturbed 
0 26 0 
1 19 2 
2 4 14 
3 0 19 
4 1 8 
5 0 6 
6 0 0 
7 0 1 
Mean 6 3.1 (t = 12.88) 


Cumulative percentages 


No. Normal Disturbed 
1 or more 48 100 
2 or more 10 96 
3 or more 2 68 
4 or more 2 30 
5 or more 0 14 
6 or more 0 2 
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or more. Mean number of these signs 
in the normal group is .6; in the dis- 
turbed group, 3.1. The difference is 
statistically significant. 


AGE DIFFERENCES IN NUMBER OF 
DANGER SIGNALS 


Our earlier study (1), which sug- 
gests that quite different kinds of re- 
sponses characterize different age levels 
so far as the Rorschach response is 
concerned, leads us to the question — 
do danger signals occur more, either 
for normal or for disturbed subjects, 
at some ages than at others? Tables V 
and VI present data on this subject. 
Mean numbers of danger signals at 
each age for both groups are given, 
both for all proposed danger signals, 
and then for only those proposed dan- 
ger signals which turned out to be 
statistically significant. 

When the entire list of proposed 
danger signals is considered, it appears 
that in our normal group, the largest 
number of these signs occur at 6 and at 
7 years of age. For the disturbed group, 
high points for the occurrence of such 
signs are 6 years and 9 years, though at 
all ages except 12 years, four or more 
such signs occur and there is little dif- 
ference in number from one age to an- 
other. 

Counting only the significant signs, 
7 years appears to be the high point of 
occurrence in our normal group, 6 


TABLE V—Mean Number of Proposed 
Danger Signs per Child — Normal and 
Disturbed Subjects, at Increasing 


Age Levels 
Number of signs 
Normal Disturbed 
group group t Test 
Mean, total group 1.6 5.0 15.90° 
Median, total group 1.0 5.0 
No. of 
Age subjects 
6 yrs. 6 2.1 6.1 
7 yrs. 7 2.8 4.8 
8 yrs. 7 1.6 4.8 
9 yrs. 7 13 55 
10 yrs. 12 1.1 5.0 
11 yrs. 8 1.1 44 
12 yrs. i. 1.0 4.0 


* Significant at .01 level 


TaBLe VI — Mean Number of Signifi- 
cant Danger Signs per Child—Normal 
and Disturbed Subjects, at Increasing 


Age Levels 
Number of signs 
Normal Disturbed 
group group t Test 
Mean, total group 6 3.1 12.88* 
Median, total group .0 3.0 
No. of 
Age subjects 
6 yrs. 6 8 4.1 
7 yrs. 7 1.3 3.2 
8 yrs. 7 of 3.0 
9 yrs. 7 Pa 3.4 
10 yrs. 12 S$ 2.7 
11 yrs. 8 2 3.0 
12 yrs. 3 1.0 2.0 


. Significant at .01 level 


years in the disturbed group, with a 
subsequent steady decrease in number 
except at 9 years when there are a few 
more than at 8 years. 

Our earlier study suggests that of the 
16 danger signs proposed here, none 
are found normatively except at 7 years 
of age, at which time C responses, con- 
fabulation, and bizarre, morbid or 
troubled content may occur in pre- 
sumably normal subjects. 


DIscUSsION 


The “sign” approach to the diag- 
nosis of personality difficulty as re- 
vealed by the Rorschach, though gen- 
erally useful to clinicians, has not as 
yet been validated by experimental 
study. Even should research support 
such an approach, as the present study 
appears to do, most clinicians agree 
that it should never be used by those 
unskilled and inexperienced in clinical 
work with the Rorschach to take the 
place of thoughtful and painstaking 
evaluation of any individual protocol. 

An examiner’s total impression of a 
Rorschach response will in most cases 
reveal much more than any checking of 
a list of signs, no matter how adequate 
that list may be. Thus, emotional dis- 
turbance in a child subject is revealed 
by such things as going over and over 
the same blot area, by consistent dis- 
satisfaction with responses, by extreme 
unevenness of responses, by presence 
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of various kinds of shock responses (as 
described by Phillips and Smith (6). 
Or, as Halpern (4) points out, emo- 
tionally disturbed children have in- 
numerable ways of responding which 
emphasize their emotional disturb- 
ance. Some “emphasize control, con- 
formity, and compliance, feeling a 
need to repress aggression and accept 
the dictates of authority. Thus they 
show marked constriction and inhibi- 
tion.” Others may go to the opposite 
extreme. 

Or, with more seriously disturbed 
subjects, according to Halpern, “There 
are disturbances in identification, dis- 
organizing anxiety, defects and distor- 
tions in reality testing beyond what 
could be ascribed to immaturity, and 
an inability to establish satisfying ob- 
ject relations.” 

Thus it should be made clear that: 

1) The mere checking of a list of 
danger signs, no matter how adequate 
this list may be, should never be relied 
on to identify disturbance in the child 
subject, since it is the total impression 
of the record which most clinicians 
consider to give the most significant 
clues. 

2) The signs suggested by us in this 
paper, as possible danger signs, are by 
no means original with us. All have 
been considered at one time or another 
or by one investigator or another to be 
at least suspect when found in child 
records. 

The aim of the present paper is 
merely the comparison of records from 
a group of presumably normal subjects 
with records of a group of subjects 
comparable as to intelligence and 
socio-economic status, but considered 
to be emotionally disturbed, to deter- 
mine whether or not a series of sup- 
posed danger signs occurs to a statisti- 
cally significant degree more in the dis- 
turbed than in the normal group. 

Such a check has heretofore been 
made only, to our knowledge, by El- 
kins, who found that such signs do not 
occur more in a disturbed than in a 
normal group. The present study indi- 
cates that a list of such signs, if care- 
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fully selected, does distinguish the two 
groups — occurring more in the dis- 
turbed than in the normal subjects. 


SUMMARY 


Rorschach records of fifty boys be- 
tween the ages of 6 and 12 years, seen 
by the clinical service of the Gesell In- 
stitute and classified as emotionally dis- 
turbed were matched with records of 
fifty “normal” research cases, for age, 
sex, intelligence and socio-economic 
status. 

These pairs of records were com- 
pared for presence of 16 proposed 
“danger signals.” This list of danger 
signals is a slight modification of a 
list first proposed in an earlier study 
(1), and recently checked by Elkins 
(3) who found that her non-disturbed 
group actually’ expressed a_ higher 
number of these signs than did her 
disturbed group, all of whom were 
subjects referred to a clinic as being in 
need of child guidance services. 

Assuming that by no means all chil- 
dren referred to a child guidance clinic 
are actually “emotionally disturbed”, 
we selected from over 500 clinical re- 
ferrals seen by us in the last five years, 
fifty boys whom, judged on other re- 
sponses than the Rorschach, we had 
classified as clearly emotionally dis- 
turbed. 

When their Rorschach records were 
checked against records of fifty 
matched “normal” cases, it appeared 


that seven of sixteen_proposed_meas- 
ures Occur eeeapicaenaks more in_ the 











disturbed casés, showin liicant 
difference between normal and dis- 
urbed sub] eith the 


.01 levels. ‘Two other signs show signi- 
ficance at the .05 level though fewer 
than ten subjects were involved, mak- 
ing the measure of significance of ques- 
tionable value. Two further signs fall 
just below the .05 level of significance. 
The eleven signs which thus ap- 
peared to a conspicuously greater ex- 
tent in disturbed than in non-dis- 
turbed subjects are: 
1. Excessive manipulation. 
2. F% exceeds F+%. 








One or more C or Cn. 
70% A or more. 
No H, or Hd: H is 2:1 or more. 
Static perseveration. 
Contamination. 
Confabulation. 
Position response. 
. Elimination or sex response. 
. Bizarre content; personal 
troubled content. 
As to the number of danger signals 
present per case, if we consider all six- 
teen signs as proposed, only two of the 
50 normal cases showed more than 
three of these signs, and more than half 
had either one or none. Conversely, 
only seven of the disturbed subjects 
have as few as three of the proposed 
signs. Mean number of these signs in 
the normal group is 1.6, in the dis- 
turbed group, 5.0. This difference is 
statistically significant. 

Considering only those signs which 
occur with a statistical significance 
more frequently in the disturbed than 
in the non-disturbed group, only five 
normal subjects have more than one 
sign, while only two of the disturbed 
subjects have less than two. Of these 
signs, the mean number in the normal 
group is .6, in the disturbed group is 
3.1. This difference is statistically sig- 
nificant. 

These danger signs occur most in 
normal subjects at 7 years of age, in the 
disturbed group at 6 years. However, 
an earlier study (1) suggests that C, 
confabulation, and morbid bizarre or 
troubled content may occur at 7 years 
of age in presumably normal subjects. 


Pres roar? 


reference; 


— 


Diagnostic Validity of the Ames Danger Signals 






Thus, in conclusion, it appears that 
seven of a list of sixteen proposed dan- 
ger signs do occur to a statistically sig- 
nificant extent, more in a group of 
emotionally disturbed subjects than in 
a group of normal subjects matched 
for age, sex, I.Q. and socio-economic 
status. Four others occur just below a 
statistically significant level. 


However, it should be noted that in 
our opinion a sign list, though useful 
in detecting emotional disturbance as 
reflected in the Rorschach response, 
should not be relied on as the sole 
measure of disturbance. The total con- 
figuration of the response must always 
be considered. 
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Sex Identification at Different Ages Using the Draw-A-Person Test 
R. L. BuTLer and F. L. Marcuse 


State University of Washington 


The evidence bearing on both the 
incidence and significance of the sex 
of the first figure drawn, when Ss 
are told to draw a person, is confused 
and inconclusive. Brown and Tolor in 
a review of the literature bearing on 
this question have stated that “testing 
of a careful representative sample of 
each age group throughout early and 
later childhood is very much needed 
in order to investigate developmental 
changes or differences” (1, p. 207). 
Several studies (2, 4, 5) offer evidence 
to support the hypothesis that while 
the great majority of normal adult 
males draw their own sex first, it is 
quite common for normal adult fe- 
males to draw the opposite sex first. 
These studies would suggest that ap- 
proximately 85 per cent of males draw 
their own sex first while the corre- 
sponding figure for females is in the 
neighborhood of 60 per cent. 

The assumption is made that boys 
and girls will also show this same sex 
difference in their drawings. Such an 
assumption tends to be a priori. Ina 
study by Weider and Noller (6), it 
was reported that of 73 boys and 80 
girls, between the ages of 8 and 10, 
of normal intelligence and from upper 
middle and lower socio-economic 
groups, 74 pes cent of the boys and 97 
per cent of the girls drew self-sex fig- 
ures first. This is contrary to what oc- 
curs in adults. In another study Jolles 
(3) attempted to provide develop- 
mental information on children’s 
drawings. This investigator was con- 
cerned, among other problems, with 
the question as to which sex children 
of grade school age tend to draw first. 
For this purpose he collected draw- 
ings, using the H-T-P test, from 8,500 
children of ages 5-12, whose grade 
placement ranged from kindergarten 
to sixth grade. These children were 
located in five towns in Illinois. He 


found that of a sample of 1282 boys, 
of different ages, 85 per cent drew the 
male figure first and that of a sample 
of 1278 girls, of different ages, 80 per 
cent drew the female figure first. 
Both the findings of Weider and 
Noller (6) and the results of Jolles 
(3) suggest that school age children 
do not order the sequence of sexes as 
do adult groups. Both studies however 
pose certain —. Was the work 
of Weider and Noller representative?! 
Was their total number of Ss adequate? 
In Jolles study, did the fact that the 
classroom teacher administered the 
test have any effect? On what basis was 
the sample -of approximately thirty 
per cent (2560) selected from the orig- 
inal 8500 drawings? Did his procedure 
of combining different ages conceal 
developmental changes? The problems 
osed by such questions were the sub- 
ject matter for the present investiga- 
tion. 


PROCEDURE 


All students who were enrolled in 
the Pullman, Washington schools and 
who were present on the days when the 
drawings were obtained, served as Ss 
for this study, Drawings from 810 boys 
and 734 girls from 5 to 18 years of age 
and from kindergarten to twelfth 
grade were obtained. A total of 1444 
drawings representing 83.3 per cent of 
all pupils attending Pullman schools 
were secured (a measles epidemic had 
lowered the number somewhat). In 
addition to this figure of 1444, 100 
drawings, 49 girls and 51 boys, were 
procured from a private kindergarten.” 


2 This study was done in partial fulfillment of 
the requirements for the degree of Master 
of Science in Psychology at the State Uni- 
versity of Washington. 

*The 100 private kindergarten pupils may 
not be a representative sample. However a 
public kindergarten does not exist in Pull- 
man. 
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Sex Identification at Different Ages Using the Draw-a-Person Test 


Tas.e I—Errors of Nine Judges Asked to Rate Drawings of Five to Eight 
Year Olds as Male, Female, or Ambiguous 


Age 
Male (N=25) 
Female (N=18) 
Ambiguous 
Total Error. 
% Error 


Male (N51) 
Female (N=73) 
Ambiguous 
Total Error 
% Error 


7 Male (N=77) 
=147 Female (N=70) 
Ambiguous 
Total Error 
% Error 


8 Male (N81) 
N=146 Female (N—65) 
Ambiguous 

Total Errot....... 

, to 





5 
N43 

















6 
N—124 












































Ss were tested in a group with the 
teacher out of the room. Each class, 
consisting of capers 25 Ss, was 
given ten minutes to complete a single 
drawing. The following instructions,’ 
the same for Ss at all ages, were read 
to each group by the E. “On the sheet 
of paper you now have, I want you to 
draw somebody. You may draw the 
person any way you want. I am not in- 
terested in how well you draw, I just 
want to know how you would try to 
draw somebody. Please do this your- 
self. Do not look at anybody else’s 
paper. Go ahead and draw one per- 
son.” All questions were answered with 
“You may do it any way you like—do 
whatever you think best.” At the end 
of the ten minute period the E col- 
lected each drawing and at the kinder- 
garten and elementary levels asked 
each S the sex of the figure drawn. In 
addition, the age (to the nearest year) , 
grade, and sex of the S were also re- 
corded by the E. At the junior high 
level and up, the above procedure was 
slightly changed and Ss were told, at 
the end of ten minutes, to record this 
same information on the back of their 
drawing. 

At each age the percentage of males 


Judge 
‘+ 442 6 2 2 8 
:b+ © tt © & 2 #4 
4 9 0 2 5 0 4 2 2 
17 8 29 11 17 2 12 18 16 
22 18 29 16 22 28 18 24 20 
52 42 67 37 52 65 42 56 47 
6 7 
2 3 
13° «11 
21° 21 
17, (17 
2 
7 
9 


_ 
eo 
_ 


or 
OOogewcoc CNs1— 


drawing males, males drawing females, 
females drawing females, and females 
drawing males was determined. Tables 
and a curve were then derived from 
these data in order to provide norma- 
tive figures and to see whether there 
was an age range at which the sexes 
diverged in the tendency to draw self- 
sex figures in response to the direc- 
tions, draw a person. 

Jolles (3) had reported that only 
one per cent of the drawings of five 
year olds were ambiguous as to sex, 
but pilot data® had cast doubt on this 
statement. Consequently nine judges 
were given the drawings of the five 
year olds with the following instruc- 
tions: “Please separate these drawings 
into a male pile and a female pile. If 
the sex of a figure is ambiguous or in- 
determinate to you, place it in a third 
pile.” Two of the judges also did this 
for ages six, seven and eight. The age 
level involved was unknown to the 





* A pilot study was made in a nearby Colton, 
Washington school to determine whether 
younger Ss would have any difficulty under- 
standing the instructions and whether the 
time period (10 minutes) alloted was ade- 
= Both were found to be satisfactory. 

nly 1 of 1545 drawings had to be discarded. 
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judges and the two judges who made 
evaluations for ages five through eight 
were presented the drawings of the dif- 
ferent age groups in random order. 


RESULTS 


Table I shows the errors of the nine 
judges who were asked to rate the 
drawings of the five year olds as male, 
female, or ambiguous (indeterminate). 
Since the sex of the drawn figure was 
known, judgments of male when the 
figure drawn was intended to be a fe- 
male, female when the figure drawn 
was intended to be a male, or the 
judgment of ambiguous were all con- 
sidered errors. For example at age five, 
judge four (Table I) had designated 3 
of the male drawings as female, 2 of 
the female drawings as male, 11 of the 
drawings as ambiguous and thus had 
a total of 16 errors (37%) . Judge three 
found 29 of the five year old drawings 
to be ambiguous and thus made a total 
of 29 errors (67%) . The mean percent- 
ages of error of the nine judges in this 
study, for the drawings of the five year 
old Ss, was 51. This is not in accord 
with the one percent figure for this age 
reported by Jolles (3). The results of 
the two judges who also classified the 
drawings of the six, seven and eight 
year olds, showed a decrease in the 
number of errors as the age of the Ss 
increased, although even at age eight 
the sex of the figure drawn was not 
always obvious. 7 


TABLE II—Sex of First Figure Drawn 
by 810 Males 5 to 18 Years of Age 
Male Female 

Age 

No. Percent No. Percent 
5 (N=26) 18 692 8 308 
6 (N=51) 38 74.5 13 25.5 
7 (N=77) 59 76.6 18 23.4 
8 (N81) 71 87.7 10 12.3 
9(N—81) 70 86.4 1 13.6 


10 (N=82) 7 1S 7 85 
11 (N—73) 70 959 3 4.1 
12 (N—66) 560 8490s 
13 (N—79) 75 949 4 5.1 
14 (N—51) 49 96.1 2 3.9 
15 (N—48) 47 979 1 2.1 
16 (N—=39) 38 («974 1 26 
17 (N—39) 38 («974 1 2.6 
18 (N=17) 16 94.1 1 5.9 
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Tables II and III show the distribu- 
tion of drawings, according to the sex 
of the figure drawn, for males and fe- 
males of 5 to 18 years of age. These 
facts and the fact that males and fe- 
males tend to diverge at a certain age 
range on the drawing of self-sex figures 
are shown in Figure 1. A two-tailed 
test of significance showed that the 
probability was .002 that the differ- 
ences obtained in the two curves of 
Figure 1, would occur by chance. 


DISCUSSION 


The data reported show the impor- 
tance of having young Ss, especially 
below the age of eight, designate the 
sex of the figure drawn. Results show 
that the number of incorrect judg- 
ments concerning the sex of the figure 
drawn decreases as the age of a S in- 
creases. 

In order to reduce the number of 
uncontrolled factors, the E, instruc- 
tions, materials, and length of testing 
period were kept constant for all Ss. 
The findings of this study, with Ss 
above eight years of age, would sup- 
port the results obtained with adult 
groups, namely that while both sexes 
tend to draw their own sex first on the 
DAP, males tend to draw significantly 
more self-sex figures than females. Be- 
low the age of eight, the tendency to 
draw self-sex or opposite-sex figures 
does not differentiate between boys 
and girls. This study then may be said 


TaBLeE III—Sex of First Figure Drawn 
by 734 Females 5 to 18 Years of Age 


Male Female 
Age 
No. Percent No. Percent 
5 (N=17) 7 412 10 58.8 
6 (N=73) 9 12.3 64 87.7 


7 (N—70) 18 2.7 52 743 

8 (N—65) 15 231 50 769 

9 (N—74) 21 «4284 «= 58—s«S72*G 
10 (N—76) 2 203 %56 73.7 
11 (N=70) 16 29 54 & 77.1 
12 (N—51) 14 275 387 725 
13 (N—77) 23 «69960C—i4ACO704 
14 (N—42) 11 062 $81 738 
15 (N—49) 12 245 = 387s 75S 
16 (N—36) 12 333 24 66.7 
17 (N=23) 13 565 10 435 
18 (N=11) 7 63.6 4 364 





302 


100 
90F 
sor 
TOF 


60} 








Sex Identification at Different Ages Using the Draw-a-Person Test 
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to yield results which both support and 
refute the idea that children behave 
like adults in their response to instruc- 
tions for the DAP test. Whether the 
performance of children on the DAP 
is similar to or different from the per- 
formance of adults depends on age. 
The over-all results are interpreted as 
being related to the “dominance” (or 
incipient awareness of the 1 agra 
of the male sex in the American cul- 
ture as reflected in its attitudes, mores, 
customs, and institutions. 

The present findings do not, as may 
be thought, have any implications for 
the latency period. According to 
Freud, the S in this period may show 
either reaction-formation or sublima- 
tion, in which case he might draw 
either the opposite or the self-sex fig- 
ure. Therefore the assumption of a 
latency period is not open to scientific 
testing. 


SUMMARY 


The purpose of this study was to 
ascertain whether children, adoles- 
cents, and young adults, respond to 
the DAP as do adult groups. Some 810 
boys and 734 girls between the ages of 
5 and 18 were used as Ss. The results 
indicate that boys and girls above the 





age of eight do respond to the DAP 
like adult groups, but below the age 
of eight the tendency to draw self-sex 
or opposite-sex figures appears equally 
as strong in girls as in boys. The re- 
sults are interpreted as having no bear- 
ing on the latency period but as re- 
flecting a growing awareness among fe- 
males of the dominance of the male sex 
in contemporary western civilization. 
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The Effect of Examiner Status Upon Rorschach Performance ' 


FRANCES A. CAMPBELL AND PAUL B. FIDDLEMAN 
University of North Carolina 


Several studies and theoretical an- 
alyses (2, 3, 7, 9, 10, 11, 12, 18, 14) 
indicate that situational variables may 
have an important influence upon 
Rorschach performance. The status of 
the examiner is one such variable not 
yet studied experimentally. The pres- 
ent research is directed primarily to- 
ward an experimental analysis of the 
influence of E’s status, as perceived by 
S, upon S’s Rorschach performance. 


With the idea that more meaning- 
ful results might be obtained if the 
subjects were pre-selected on the basis 
of some psychologically relevant vari- 
able, the differential effects of need- 
achievement (hereafter referred to as 
n-ach) upon Rorschach performance 
under various status conditions were 
investigated. The n-ach scale of the 
Edwards Personal Preference Sched- 
ule (4) (hereafter referred to as 
EPPS) was used to measure this per- 
sonality dimension. 

An E’s status might affect an S’s 
Rorschach responses in a variety of 
ways. For exploratory work three 
measures were chosen as dependent 
variables most likely to be affected: 
the number of responses given to the 
cards, the number of whole responses 
given, and the range of content or 
number of content categories used 
(hereafter referred to as R, W, and 
NCC, ne . Three predictions 
were made 


1. R, W, and NCC scores will be 
greater under high status conditions 





* This work was completed as part of the re- 
quirements for the M. A. degree. Data col- 
lection was facilitated by support provided 
by a grant from the United States Public 
Health Service (Grant M-1027, E. Earl 
Baughman, principal investigator). The 
authors are indebted to Dr. Baughman, who 
directed the work, and to Drs. John J. Mel- 


linger, Samuel L. Guskin, W. Grant Dahl- 
strom and Shephard Liverant. 


than under low status conditions. 


2. R, W, and NCC scores will be 
greater for Ss with high scores on n- 
ach than for Ss with low scores on 
n-ach. 


3. R, W, and NCC scores will be 
greatest under high status conditions 
with high n-ach Ss and smallest under 


low status conditions with low n-ach 
Ss. 


PROCEDURE 


Experimental Design 


The experimental design is shown 
in Table I. Each of two Es tested 


TABLE I—Experimental Design 
High n-ach Lown-ach 
El E2 El £2 


High status 7Ss 7Ss 78s 7Ss 
Undefined status 7Ss 7Ss 7Ss_ 7Ss 
Low status 7Ss_ 7Ss 78s 7Ss 


equal numbers of Ss under each of the 
status sets, and their Ss were equally 
divided between high and low n- 
achievers. 


Subjects 


Ss were male undergraduates at the 
University of North Carolina, naive 
with respect to the Rorschach. They 
were selected from a pool of 430 Ss 
to whom the EPPS had been admin- 
istered. The full range of n-ach scores 
(0-26) was represented in the group, 
with a mean of 15.1 and a S. D. of 
4.0. Cutting scores (established at 10 
and 20) included the upper 19% and 
lower 18% of the group, giving 157 
persons from whom the 84 Ss were 
randomly drawn and assigned to the 
experimental groups. 


Examiners 


The examiners were a male and a 
female graduate student, the same age, 
with nearly identical training and ex- 
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perience in the use of the Rorschach. 


Status Sets 


The status sets were induced pri- 
marily through the use of short type- 
written paragraphs describing the pur- 
pose of the study and the persons con- 
ducting it. These paragraphs were 
identical for the three status sets, ex- 
cept for the way in which E and the 
purpose of the study were defined. In 
the high status set E was defined as a 
research scientist, in the low status set 
as a student working on a term paper, 
and in the undefined status set Es were 
referred to as “individuals.” The Es 
dressed in a manner appropriate to 
the status set: white lab coats for high 
status, casual street dress for undefined 
status, and, in the case of the female E, 
“bobby socks” for low status. Es also 
introduced themselves as Dr., Mr., or 
using their first name according to the 
set. Otherwise, standard procedures 
were maintained throughout, using 
bare offices. 


Testing Procedure 


The testing procedure included 
standard greeting and introduction of 
E, reading of the status setting para- 
graph by S while E left the room 
briefly, then the administration of the 
Rorschach. Beck’s (1) suggested in- 
structions were used, with S seated 
with his back to E. Encouragement was 
given if S rejected or gave only one 
response to Card I, never after that. 
Finally, Form A of the Thurstone Test 
of Mental Alertness (15) (hereafter 
referred to as TMA) was given to each 
S to check for possible differences in 
intellectual ability between the 
groups. 

Scoring 


The Es scored their protocols inde- 
pendently for productivity, location, 
and content. Scoring reliability was 
determined by having each E re-score 
twenty of the other E’s records and 
computing reliability coefficients 
(Pearson product moment) for each 
category used. These r’s ranged from 
+.97 to +.99. 


The Effect of Examiner Status Upon Rorschach Performance 


RESULTS 


There were no significant differ- 
ences between the groups with respect 
to intelligence. The relationships be- 
tween the dependent variables (R, W, 
NCC) and intelligence were also an- 
alyzed, but no significant correlations 
were found. 

The means and S. D.’s for R, W, 
and NCC scores obtained under each 
experimental condition are shown in 
Table II. The results of the analysis 
of variance for R are presented in 
Table III. 

The analysis of variance for R shows 
that the number of responses given by 
the Ss varied as a function of the status 
setting used, while n-ach and the Es 
did not significantly affect the vari- 
ance, T-tests revealed differences be- 
tween the mean R in High and Unde- 
fined and between High and Low set- 
tings significant at the .01 level, but 
no difference between Undefined and 
Low settings. Ss gave significantly more 
responses under High status conditions 
than they did under the other two 
sets. 

Correlation-coefficients revealed that 
little of the variance in W could be 
predicted from R, therefore no control 
for R was necessitated. A logarithmic 
transformation of W scores was made 
since the variance in one cell exceeded 
its mean. Analysis of variance then 
showed that none of the independent 
variables significantly influenced W. 

The high correlation between R and 
NCC (average Pearson product mo- 
ment r = +.79) made the use of an- 
alysis of co-variance necessary in order 
to show how NCC varied independent- 
ly of R. Two of the F ratios were sig- 
nificant: the interaction between n-ach 
and Es (p = .05) and the interaction 
between status and the Es (p = .05). 
No one of the independent variables 
alone, however, had a statistically sig- 
nificant influence upon NCC. 


DISCUSSION 


The major finding of this research 
is that the way in which E is defined 
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TABLE Il—Means and Standard Deviations for R, W, and NCC Obtained 
by Two Es from High and Low n-achievers under High, 
Undefined, and Low Status Settings 


Rorschach Status 


High and Low n-ach 


Score Setting High n-ach Low n-ach Combined 
El E2 BothEs El E2BothEs El E2 BothEs 
(N= (N= (N= (N= (N= (N= (N= (N= (N= 
7) 7) 14) 7) 7) =14) 14) 14) 28) 
R High ‘4 39.9 29.0 344 37.3 43.1 402 38.6 36.1 37.3 
S.D. 18.4 144 16.7 13.0 21.0 17.1 15.3 188 169 
Undefined = 274 31.3 294 21.3 26.7 240 244 290 26.7 
S.D. 12.0 13.0 122 88 208 15.6 10.6 168 14.0 
Low = 249 25.0 249 36.7 23.1 299 308 24.1 274 
S.D. 60 170 123 205 11.7 175 15.8 14.1 15.1 
All sets x 30.7 284 296 318 31.0 314 312 29.7 30.5 
S.D. 14.1 144 140 16.1 19.6 17.7 14.0 17.0 159 
Ww High x 13.3 96 114 94 96 95 114 96 105 
S.D. 168 3.3 11.8 5.7 5.1 52 123 41 9.0 
Undefined E 3 8.7 106 9.7 90 8.1 86 89 94 9.1 
S.D. iS 749 85 46 43 4.3 34 62 49 
Low x. 94 101 98 121 74 98 108 88 98 
S.D. 28 34 3.0 68 3.5 5.7 52 36 4.5 
All sets Ba 10.5 10.1 10.3 102 84 93 104 92 98 
S.D. 96 50 76 57 642 «6550 78 47 64 
NCC High =x 11.3 8.7 10.0 12.7. 9.1 10.9 12.0 89 10.5 
S:D. 82 40 3.7 45 42 4.6 $8 39 4.1 
Undefined x= 9.3 11.7 104 80 86 83 8.6 10.1 9.4 
S.D. $5 38 3.7 26 5.7 43 29 49 4.1 
Low ‘4 81 79 80 119 76 97 100 7.7 89 
S.D. ig 82 25 39 28 3.9 32 29 83 
All sets x 95 94 95 10.9 84 96 102 89 96 
S.D. 3.0 39 5.0 41 42 4.3 36 41 3.9 
TABLE III—Analysis of Variance for R 
Source df Variance Estimate F P 
Status 2 987.94 4.15 Ol<p<.05 
n-ach 1 68.76 <i 
Examiners 1 48.76 <i 
Interaction 
status x n-ach 2 541.45 2.27 j10<p<.20 
Interaction 
status x Es 2 230.73 <l 
Interaction 
n-ach x Es 1 12.19 <i 
Interaction 
status x n-ach x Es 2 405.08 1.70 1<p<.20 
Error 72 238.14 
All sources 83 21,064.95 


to S may affect the number of re- 
sponses that S gives to the Rorschach 
blots. Thus the significance of the in- 
terpersonal context within which the 
testing takes place is underscored. A 
possible source of variation in results 
from one research report to another 


is also suggested. Since R is signifi- 
cantly related to most other Rorschach 
variables (5, 6), the implications of 
this finding are even more far reach- 
ing. There is in addition the sugges- 
tion that certain interactional effects 
involving the examiner may influence 
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the range of content that S produces. 

Interpersonal theory suggests that 
an individual’s performance is not un- 
related to the human situation with- 
in which it occurs. Also it seems likely 
that most Ss will want to do their best 
when working for or with a person of 
some significance. The nature of the 
Rorschach affords § no certain means 
of evaluating his performance, but the 
amount or quantity of response is oft- 
entimes, in our culture, one means of 
estimating how well one has done. The 
Ss in this research reacted to a person 
with high status by increasing the num- 
ber of responses emitted. 

There were no differences in pro- 
ductivity, however, between the unde- 
fined and low status sets. Numerous 
remarks by Ss in the undefined con- 
text suggested that they perceived E to 
be another student collecting thesis 
data, hence the similarity of results 
under the two conditions may reflect 
only their subjective identity to most 
Ss. It appears to make no difference 
whether E is specifically defined as a 
student or whether the situation is 
handled in such a way as to make this 
the natural interpretation, without the 
words being spoken. 

The finding that Ss who differ in 
n-ach do not differ in R was unex- 
pected. Before this finding can be in- 
terpreted more information concern- 
ing the validity of the EPPS scale as a 
measure of n-ach will have to be gath- 
ered. To some extent, the results of 
this research question the construct 
validity of this scale. 

The failure of W and NCC to reflect 
changes in E’s status might best be at- 
tributed to the variables themselves. 
Others have found W to be a rela- 
tively stable score despite attempted 
experimental manipulations (8). As 
for NCC, a mere counting of content 
categories may have obscured qualita- 
tive changes in content which did per- 
haps occur. 


CONCLUSION 
The Rorschach was administered to 
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84 Ss (high and low n-achievers) by 
two Es under three status sets. The Ss 
gave significantly more responses when 
E was defined as having high status 
than when his status was undefined or 
when he was defined as having status 
equal to that of S. Status did not affect 
the number of content categories used, 
however, nor the number of W re- 
sponses. No significant effects due to 
n-ach or Es were found. 
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The Perceptual Organization TAT Score: 
Number, Order, and Frequency of Components 


RicHARD H. DANA 
University of Nevada 


Concurrent validation studies of the 
TAT have proposed three simple, ob- 
jective scores which employ stimulus- 
properties of the cards (3, 5, 6, 7). 
These scoring categories, (a) Percep- 
tual Organization (PO), (b) Percep- 
tual Range (PR), (c) Perceptual Per- 
sonalization (PP), were developed 
from a rationale for objective scoring 
(2) and a statement of personality 
theory (1) . Two scoring categories, PO 
and PR, had approximately equivalent 

redictive efficiency. As a consequence 
urther research will focus on separate 
categories. 


PO DEVELOPMENT AND VALIDATION 


PO reflects one basic aspect of test 
behavior, the approach to the situa- 
tion. PO measures the S’s ability to 
follow the directions to “tell a story”. 
There are seven components which 
epitomize the usual directions: (a) 
card description, (b) present behavior, 
(c) past events, (d) future events, (e) 
feeling, (f) thought, (g) outcome. 

The 390 Ss used in the development 
of PO and for the present studies were 
equally drawn from rigorously defined 
normal, neurotic, and psychotic popu- 
lations, 240 males and 150 females. 

Two kinds of scorer reliability were 
determined, Scoring category reliabil- 
ity with randomly drawn male and fe- 
male samples, from original and cross- 
validation groups, was 93 and 94 per 
cent of agreement. The reliability of 
scoring PO components ranged from 
89 to 97 per cent. There were no differ- 
ences in scoring by psychologists or 
clerks. 

Concurrent validation studies yield- 
ed 7 ame omer scores which successfully 
differentiated between 68 and 83 per 
cent of Ss in each group from other 
groups. Sex differences are minimal. 


Length of TAT story and intelligence 
of S are not significant variables. PO 
thus has considerable value, at least for 
prediction of traditional clinical labels. 
In a sense PO scores provide an index 
of degrees of psychiatric illness. Details 
are contained in a manual summar- 
izing development and validation re- 
search (8). 


PROBLEM 


Previous research has been propa- 
deutic to validation of PO with predic- 
tions of personality characteristics as 
external criteria. These predictions 
have primary importance in the at- 
tempt to increase the probability of 
“correct” clinical statements from 
TAT data. Pilot research using the 
order and frequency of PO compo- 
nents pe relationships to per- 
sonality variables and to psychopa- 
thology. Prior to any examination of 
specific hypotheses, it was necessary to 
evaluate the stimulus-value of the 
TAT cards with respect to number, 
order, and frequency of PO com 
nents elicited. The salient question 
with respect to PO is: How similar are 
the stimulus-values of the TAT cards 
for number, order, and frequency of 
PO components for normal and clini- 
cal Ss? 


METHOD AND RESULTS 


PO stimulus-value of TAT cards: The 
numbers of PO components elicited by 
each card are roughly equivalent. Evi- 
dence for this accrues fate product- 
moment correlations between PO 


scores for each card with PO scores for 
all cards. A mean correlation (z trans- 
formation) of .70 was obtained with a 
range from .60 (card 16) to .78 (card 
14) . Since a short-form TAT set is used 
throughout (4), only those pertinent 
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correlations will be reported: Card 
2—.68; card 3GF=.76; card 4=.69; 
card 6GF=.64; card 7GF=.74 (N=67 
normal, female college Ss). Table I 
shows the PO intercorrelations for 
these five TAT cards. 


TABLE I—PO Product-Moment Inter- 
Correlations for 67 Normal Female 
Ss on Five Cards 


Card 3 4 6 7 
2 62 40 50 ao 
3 61 EY | 58 
4 58 57 
6 60 


Note: all correlations are significant at 
<.001 level. 


There was a .91 correlation between 
the mean PO score for the five cards 
and the mean PO score derived from 
all cards. Thus, each card elicits PO 
scores which are close approximations 
of (a) the mean PO score for all twenty 
cards, (b) scores of the other four cards 
in the short-form set (for females) . 


Order of PO components: The order 
of PO components does not differ 
either from card to card or from group 
to group. Table II presents intercorre- 
lations for mean order with normal fe- 
male Ss. Nine of ten rho correlations 


TABLE I]—Rank-difference Inter- 
Correlations of the Mean Order of PO 
Components for 67 Normal Female Ss. 


Card 3 4 6 7 
2 89 .79* 89 .96 
3 .96 1.00 96 
4 .96 89 
6 96 


Note: all correlations are significant at 
<.01 level except * which is significant at the 
<.05 level. 


TasLe IJJ—Rank-difference Corre- 
lations Between Mean Orders of 
Occurrence of PO Components for 
Normal, Neurotic, and Psychotic 


Groups 
Group Neurotic Psychotic 
Normal 89 86 
Neurotic .96 


Note: all correlations are significant at 
<.01 level. 
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are significant at the <.01 level. Table 
III presents similar comparisons be- 
tween mean orders of occurrence for 
PO components of normal and clini- 
cal groups. 

There is a significant relationship be- 
tween order of occurrence for all three 
groups. The order of PO components, 
irrespective of diagnostic group, pro- 
ceeds from card description to either 
present behavior or past events, then 
to feeling, thought, future events and 
outcome. 


Frequency of PO components: Table 
IV presents rho correlations between 
frequencies of each PO component for 
the three groups. 


TasLe IV—Rank-difference Corre- 
lations Between Frequencies of Each 
PO Component For Three Groups 


Group Neurotic Psychotic 
Normal 54 50 
Neurotic .90* 


* significant at <.01 level. 


The frequencies of occurrence for PO 
components are not significantly re- 
lated for normal and clinical groups. 
The neurotic and psychotic groups, 
however, have highly similar frequen- 
cies. 


TasBLeE V—Rank-difference Inter- 
Correlations Between Cards for 
Frequency of PO Components 
in Each Group 


Card 
Group 3 +4 6 7 
Normal 
j A2 53 58 
3 60 21 58 
4 88** 85** 
6 .78* 
Neurotic 
29 87% .84* .98** 
3 A9 60 38 
4 .96** 81* 
6 89** 
Psychotic 
2 44 71¢ .86** 93** 
3 .76* 53 38 
4 85** .76* 
6 .96** 


* significant at <.05 level. 
** significant at <.01 level. 
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Rank-difference correlations were 
computed between cards and between 
groups for frequency (Tables V, VI) . 
Inspection of Table V indicates that 
frequencies for some cards are signifi- 
cantly related; frequencies for some 
cards are not significantly related. 
With ten intercorrelations for each 
group, three for normals, six for neu- 
rotics, and seven for psychotics are sig- 
nificant. Thus, the PO stimulus-value, 
with respect to frequency, is much 
more similar for clinical groups than 
for a normal group. In other words, 
psychopathology effectively reduces 
the objective differences between TAT 
cards. When intercorrelations between 
groups for frequency on each card are 
inspected, the simple inference of dif- 
ferences between normal and clinical 
groups is complicated (Table VI). It 
appears that some cards for some 
groups have similar frequencies. Al- 
though differences between groups do 
obtain when frequencies of scores for 
all five cards are used, accurate diag- 
nosis cannot be made on this basis. It is 
congruent with these results to assume 
that personality dimensions may be 
distinct from diagnostic categories and 
that discrepancies in results may be 
attributable to this. A general hypo- 
thesis accruing from this analysis is 


TABLE VI—Rank-difference Inter- 
Correlations Between Groups for 
Frequency of PO Components 


on Each Card 
roup 

Card 2 Neurotic Psychotic 

Normal .76* 58 

Neurotic 81* 
Card 3GF 

Normal 16 38 

Neurotic 87** 
Card 4 

Normal .83* 71 

Neurotic .96%* 
Card 6GF 

Normal .92** 79* 

Neurotic 85%° 
Card 7GF 

Normal A2 35 

Neurotic 96** 


* significant at <.05 level. 
** significant at <.01 level. 
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that frequency of occurrence of PO 
components is related to personality 
data. 


SUMMARY AND CONCLUSIONS 


1. All TAT cards are approximately 
equivalent stimuli. It makes no differ- 
ence which cards are used. 


2. The number of TAT cards used 
is important with five cards providing 
data equivalent to that from the entire 
set. 


3. The order of PO components is 
not related to the specific TAT card or 
to diagnostic group. A standard order 
is preserved for all groups on all cards. 


4. Frequency of occurrence of PO 
components is not related to card 
stimulus-value per se but to diagnosis 
and hypothetically to personality vari- 
ables. 


These results suggest the need for 
(a) large scale correlational attempts 
to isolate personality variables related 
to specific PO components, (b) testing 
of well defined groups which are high 
and low on previously isolated person- 
ality variables to determine the 
strength of relationships to PO com- 
ponents, (c) the development of spe- 
cific hypotheses from the above results 
which predict the direction and magni- 
tude of relationships between PO com- 
ponents and independent external cri- 
teria. This would constitute cross-vali- 
dation. Research is in process on the 
first two aspects of this proposed frame- 
work for PO validation. 
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There has been a trend in Rorschach 
interpretation recently toward more 
emphasis on content analysis, generally 
based on psychoanalytic theory. One of 
the early references to content analysis 
appears in Bochner and Halpern (2). 
In discussing cards IV and VII these 
authors write: “Card IV — the heavy 
male figure may suggest the father or 
authority in general. This may be 

leasant or unpleasant. It’s dark qual- 
ity and overwhelming characteristics 
are particularly disturbing to those for 
whom parental authority is still an un- 
resolved problem. Card VII — the two 
female faces and even female figures 
(in reverse position dancing girls) , as 
well as the generally soft light quality, 
frequently with maternal implica- 
tions.” Their hypotheses concerning 
cards four and seven are based on clini- 
cal experience and they present no fur- 
ther evidence in support of their hy 
theses, Since the original introduction 
of the idea that cards IV and VII rep- 
resent the father and mother figures 
respectively, many clinicians interpret 
responses to these cards as attitudes felt 
toward the parent figures. 

The first systematic investigation to 
determine the extent to which the 
properties of cards IV and VII actually 
do elicit responses relative to “father” 
and “mother” was done by Meer and 
Singer (8) . The hypothesis they tested 
was that normal adults will designate 
card IV as the “father card” and card 
VII as the “mother card” more fre- 
} aay than would be attributed to 
chance. The subjects they used were 50 
male college students. While their re- 
sults were statistically significant, in 
actual numbers only 12 of the 50 sub- 
jects tested chose card IV as the “father 
card,” and only ten chose card VII as 
the “mother card.” 


Rosen (11) repeated the procedures 








The Relationship Between Rorschach Responses and Attitudes 
Toward Parents 


CYNTHIA ENGEL 


of Meer and Singer, using as subjects 
180 university psychology students, 
and obtained comparable results. 
Charen’s study (3) is the most recent 
in this area. He administered the Ror- 
schach, and at the conclusion asked his 
subjects to “pick out the card which is 
most like your father,” and “pick out 
the card that is most like your mother,” 
and they were asked the reasons for 
their choices. With but few exceptions 
the questions were interpreted by the 
patients as meaning “which response 
that you gave to the Rorschach re- 
minds you of your parent.’’ Charen 
tested over 50 adults in this manner. 
Both sexes were tested; all were white 
and from various socio-economic class- 
es except relief groups. His conclusion 
is that there is a tendency to “use all 
cards in such a manner that no distinc- 
tion between cards IV and VII and the 
other eight cards could be made.” 
Thus, according to Charen, if card 
VII is always presumed to represent the 
mother figure and card IV the father 
figure for all subjects, gross errors 
would result in a percentage of cases. 


PROBLEM 


Since the previously cited studies 
have been done with adults, it was felt 
that it would be of interest if a similar 
study were done with children. In ad- 
dition to determining the cards chosen 
by children as “mother cards” and 
“father cards”, certain other problems 
were also undertaken. The questions 
investigated in the present study were: 

First, do children choose card IV as 
the “father card” and card VII as the 
“mother card” more frequently than 
they choose any other cards. 

Second, since not all children choose 
cards IV and VII, is their attitude 
toward card VII the same as toward the 
card chosen as the “mother card”, and 
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is their attitude, whether positive or 
negative, toward card IV the same as 


their attitude toward the card chosen 
as the “father card.” If the attitudes 
toward the cards chosen as_ the 
“mother” and “father” cards and 
toward cards IV and VII are respective- 
ly the same, then if a card other than 
IV and VII are chosen there might still 
be some justification for interpreting 
responses to cards IV and VII as rele- 
vant to attitudes to mother and father. 


SUBJECTS 


The 30 subjects used in this study 
were volunteers recruited from the 
author’s classes and classes of her col- 
leagues.* All subjects were boys be- 
tween the ages of 11 and 12. They 
varied in school grade from five to 
seven. The children were all from mid- 
dle and upper middle class homes and 
of the Jewish religion. 


PROCEDURE 


Each subject was administered the 
Rorschach according to the method as 
outlined by Klopfer (6). At the com- 
pletion of the standard Rorschach test 
the subject was asked to choose two 
cards: “One that in your imagination 
makes you think of your father, and 
one that in your imagination makes 
you think of your mother.” 

After choosing the two cards the 
child was asked to explain why he 
chose them. All the subjects were able 
to choose two cards and explain the 
reasons for their choice. 

All the Rorschach responses were 
scored as indicating positive or nega- 
tive attitudes. If it was not clear during 
the administration whether a response 
was positive or negative, then during 
the “Testing of the limits” the subject 
was questioned and encouraged to talk 





The author wishes to thank the officials of 
the East-Rockaway Jewish Center for per- 
mission to conduct the study, and her col- 
leagues for their help in recruiting the boys 
who participated. 

? Based on a pretest which had been under- 
taken previously, this was found to be the 
most easily understood form of posing the 
question to the boys. 
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a little more about the animal or leaf 
or whatever else was the subject of his 
response. As a result each response be- 
came scorable as either positive or neg- 
ative. 


RESULTS 


In Table I we have presented the 
frequency with which each card was 
chosen as a “father card” and “mother 
card” by the 30 children who partici- 
pated. It is apparent that there is a 
great deal of variety in the children’s 
choices. Although there is no strong 
trend toward choosing cards IV and 
VII respectively as the “father” and 
“mother” cards, card III was chosen 
a great deal. The two most frequent 
choices for a “mother card’”’ were 
cards III and VII. The two most 
frequent choices for a “father card” 
were cards III and IV. Since card III 
is the most potent human figure in 
the set of cards, is there a different 
set of identification, diffuse and undif- 
ferentiated, in the pre-adolescent as 
contrasted with the adult? 

A comparison was made between the 
attitudes expressed toward card IV and 
those cards chosen as the “father card” 
and the attitudes expressed toward 
card VII and those cards chosen as the 
“mother card.” It was felt that if those 
attitudes are similar, then there would 
be justification for this type of inter- 
pretation as “mother card” and “father 
card.” A comparison was also made be- 
tween the cards chosen as mother and 
father cards and card X as a control 
card. The agreement between card VII 


TABLE I—Choice of Mother and 
Father Cards 


Number of Number of 

times chosen times chosen 

Card No. as mother card as father card 
1 y 2 
2 2 0 
3 8 6 
4 1 6 
5 0 z 
6 4 3 
7 6 2 
8 2 1 
9 + 4 
10 1 4 
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and the chosen mother card is 56.6%, 
and the agreement between card X and 
the chosen “mother card” is 40%; the 
difference between these two propor- 
tions is not significant at the 5% level. 
Agreement between card IV and the 
chosen “father card” is 50%, and the 
agreement between card X as the con- 
trol and the chosen “father card” is 
50%. Thus there is no greater agree- 
ment between the attitudes expressed 
toward card VII and those expressed 
toward the chosen “mother card” than 
those expressed toward card X and the 
chosen “mother card.” 

Neither is there any greater agree- 
ment between the attitudes expressed 
toward card IV and the chosen “father 
card” than there is between card X and 
the chosen “father card.” Since the 
cards children choose as mother and 
father cards vary greatly, and there is 
no greater agreement between the atti- 
tudes expressed toward card IV and 
VII and the attitudes expressed toward 
the chosen “father” and “mother” 
cards respectively, and the attitudes 
expressed toward card X as the control 
card, there seems to be no justification 
for using cards IV and VII as the 
“father card” and “mother card” for 
purposes of interpretation. 

To test whether the different Ror- 
schach cards actually do elicit different 
attitudes depending on stimulus mate- 
rial, a correlation was computed be- 
tween the percent of responses to cards 
1, 3,5, 7, and 9 which were positive and 
the percent of positive responses to 
cards 2, 4, 6, 8, and 10. The correlation 
is .20, not significant at the 5% level. 
This indicates that the Rorschach 
cards do elicit different attitudes, but 
these differences in attitudes do not in 
any way lend support to the theory that 
certain cards elicit responses that in 
some way are related to the mother and 
father figures. 

Thus the two questions posed at the 
outset of this study have been an- 
swered. These findings fail completely 
to support the theory that in terms of 
content analysis, card IV is the “father 
card” and card VII is the “mother 
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card.” The results of this study togeth- 
er with the findings of Charen clearly 
indicate that to interpret blindly reac- 
tions to card IV as reactions to the 
father figure and reactions to card VII 
as reactions to the mother figure may 
lead to gross errors of interpretation. 
However one interesting point was 
raised as a result of the children’s 
choices of mother and father cards. 
The boys picked card III frequently 
for both the mother and father card. 
This suggests that the identification 
may be different for these pre-adoles- 
cents than it is for adults. To follow 
this through, the author is now testing 
girls between the ages of nine and 
eleven to see whether they also exhibit 
this diffuse undifferentiated process of 
identification by choosing card III fre- 
quently as the father and mother card. 


SUMMARY 


The study was undertaken to see 
whether children choose cards IV and 
VII as the “father” and “mother” 
cards, and since not all children will 
choose these cards, are their attitudes 
towards the cards chosen as “father” 
and “mother” cards similar to their 
attitudes towards cards IV and VII. 

The results obtained clearly indicate 
that when children choose a “mother” 
and “father” card there is little differ- 
ence between cards IV and VII and the 
remaining eight cards in regard to fre- 
quency of choice. This study provides 
no support for the theory that cards IV 
and VII usually represent the father 
and mother figures respectively. Inter- 

retations based on this theory will be 
in error a sizeable proportion of the 
time. It was found, however, that card 
III was chosen frequently for both the 
“mother” and “father” card. The au- 
thor is now doing a follow up to see 
whether girls between the ages of nine 
and eleven will exhibit this reaction 
too. 
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Types of Pre-Cognitive Defensive Organization in a 
Tachistoscopic Experiment 


UF KRAGH 
Stockholm, Sweden 


INTRODUCTION 


The starting-point for the experi- 
ments reported here has been the 
theory of defense mechanisms formu- 
lated by Sigmund and Anna Freud and 
others, and the actual-genetic model of 
perception-personality (6, 7, 8, 13, 
15). S. Freud (4) has given a scheme 
for the dynamics underlying defense 
mechanisms. Very roughly this scheme 
comprises three links: threat/danger, 
anxiety, and defense against anxiety. 
We have transformed this scheme into 
experimental procedure, equally en- 
compassing three links: subliminal per- 
ceptual threat, (subliminal) anxiety, 
and defensive response. 

The actual-genetic model of percep- 
tion-personality has been conceived in 
close interaction with an experimen- 
tal technique for the study of pre-cog- 
nitive organization and of the develop- 
ment of percepts (6, 7, 8 and 12, 13, 14, 
15; cf. also 2 and 16) . We are not enter- 
ing in detail upon the theoretical foun- 
dations or the various technical as- 
pects; they have been laid down else- 
where. Though the experiments on 
“defensive pre-cognitive organization” 
presented here have been worked out 
within the frame of the actual-genetic 
(act) theory of perception and person- 
ality, interpretations of the findings do 
not necessarily presuppose the accept- 
ance of the model as a whole. 

During the last decade the concept 
of “perceptual defense” has been the 
object of continuous discussion. Even 
if there is affinity of approach with re- 
spect to techniques there are, however, 
great differences between the analysis 
of pre-cognitive organization given 
here, and the analysis of “perceptual 
defense” mainly aiming at establishing 
the frequency of raised thresholds for 
correct recognition. 


METHOD AND DEFINITIONS 


The technique consists in the succes- 
sive exposure of a TAT-like picture by 
the tachistoscope, exposure time being 
progressively increased. 

The tachistoscope is made of a projector 
with a 100 Watt bulb and a camera shutter. 
A piece of paper is placed around the projec- 
tor tube so as to screen off the experimenter’s 
manipulations from the Ss. The pictures are 
projected onto a transparent screen 25”x14”. 
The screen is placed between the projector 
and the Ss at the distance where the projected 
(and focused) picture just covers the screen. 
No fixation point is utilized. The illumina- 
tion of the room is regulated so that the pic- 
ture is barely recognized clearly and distinct- 
ly at unlimited exposure time. Under the 
standard conditions and with the standard dia- 
positives employed this equals scale values 
.4-2.0 on a Weston Master Universal Exposure 
Meter placed at the distance of 5 m. in front 
of the screen, and directed towards the projec- 
tor. 


There are 12 exposures of each picture, in 
msecs: 20, 40, 40, 100, 100, 200, 200, 200, 500, 
500, 500, 500. 

The stimulus employed here is the TAT 
picture number | (boy with a violin), the head 
and shoulders of a threatening and ugly male 
person having been inserted at the right above 
the boy. The experiments are conducted with 
groups of not more than about 40 Ss. The first 
row of Ss is placed at the distance of about 
6 m. from the screen. (Mentally abnormal Ss 
are tested individually.) 


The Ss are instructed to make a drawing of 
what they have seen on the screen without 
paying attention to whether their impression 
is correct or not. For each exposure they have 
to make a drawing in a square. If they feel 
unable to make any kind of drawing, they are 
also allowed to make markings instead of a 
drawing. In addition to the drawings (mark- 
ings) a written comment is also required so as 
to make quite clear what each S has seen, and 
at what place on the screen. The Ss are told to 
work as fast as they can without neglecting the 
thoroughness of description. The time interval 
between the exposures is determined by the 
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time required by the slowest S for reporting. 
It has rarely been more than about 3 minutes. 

In all 260 aviation cadets took part in the 
experiments; their age varies between 18 and 
21 years. They were all subjected to a pre- 
liminary screening procedure, medical and 
psychological. None has an I.Q. much below 
the average, and visual acuity is normal. Their 
educational level varies between elementary 
school and college (Swedish Gymnasium). 

Definitions. Actualgenesis denotes 
method as well as (“pre-cognitive”’) 
data in their serial arrangement. The 
concept phase is defined as the sum 
total of data registered in connection 
with one exposure. In the experiments 
referred to here, the first phase corre- 
sponds to the exposure at which S has 
first seen something to which he attri- 
butes a meaning, and the last phase 
that exposure at which he has seen the 
picture “‘correctly”. P-phases we term 
all those phases which precede the last 
phase. 


DEFENSIVE ORGANIZATION AND 
DEFENSE MECHANISMS 


The analysis of certain well defina- 
ble types of pre-cognitive organization 
will be related to the frame of psycho- 
analytic defense mechanisms for the 
following reasons: 

1. Specific types of pre-cognitive or- 
ganization have been found to charac- 
terize compulsion neurotic patients 
and have been associated with the de- 
fense mechanism of isolation (6 pp. 47, 
191-204 and 288; 14). 

2. In a small group of patients suf- 
fering from anxiety hysteria, the fol- 
lowing phenomena were observed. 
Exposing the TAT picture nr 6GF by 
the tachistoscope in the manner des- 
cribed above, an anxiety-laden black 
spot gradually emerged at the place of 
the man in the picture. But at the 
very exposure where the man’s head 
was on the verge of becoming recog- 
nized by the (female) patients, they 
perceived instead an ugly, threatening 
devil’s mask. For three reasons this 
particular pre-cognitive structure was 
tentatively identified by referring to 
the defense mechanism of repression. 
Since S. Freud (4, pp. 63-65 and 155- 





158), repression is considered to be 
the main mechanism operating in hys- 
teria. The threat presented by the pic- 
ture was a sexual’ ’ one, and the trans- 
formation of the visual field occurred 
at that perceptual level where the 
heterosexual relation was barely be- 
coming recognized by the Ss (cf. 1, pp. 
148-150). Finally the structural char- 
acteristics of “rigidity” (possibly con- 
nected with hysterical paralysis) also 
made it plausible to refer to the mech- 
anism of repression (9). 

3. Granted that no sharp border- 
lines separate the “normal” personal- 
ity from the “abnormal” one as re- 
gards the working of defense mech- 
anism (the occurrence of “defensive” 
pre-cognitive organization), analogous 
mechanisms (types of pre-cognitive or- 
ganization) should be found in nor- 
mal Ss with “abnormally strong” 
stimuli as in abnormal Ss with “weak” 
or “moderately strong” stimuli. If a 
threat is very strong, it seems reason- 
able to assume that even ordinarily 
rather well-adapted Ss are likely to 
take refuge in defensive measures (cf. 
10 p. 582). Starting with the experi- 
ences with compulsion neurotic and 
hysteric patients mentioned, the nor- 
mal secondary person in the (TAT) 
picture was thus replaced by an ab- 
normally threatening one in order to 
evoke analogous abnormal (defensive) 
responses in normal] Ss. 

Personality descriptions and observations of 
behavior of aviation cadets who failed in pri- 
mary training were made by officers of the 
training staff. These descriptions, too, lend 
support to an identification of defensive or- 
ganization with defense mechanisms. They 
have not, however, been formalized for a 
rating procedure. 


4. We have already pointed at the 
close correspondence between Freud's 
scheme and the scheme of our experi- 
ments. In many cases we find manifest 
anxiety reported by the S at one or 
more exposures, the anxiety referring 
to a black or dark part of the visual 
field; at the next exposure this organi- 
zation is replaced by another less stim- 
ulus-conformable one, the anxiety (dis- 
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pleasure) disappearing more or less 
completely. 

Utilizing Freud’s terminology we say 
that in our experiments, secondary 
anxiety is induced, “in which some- 
thing occurs in the id which activates 
a danger-situation for the ego and in- 
duces the latter to give the anxiety sig- 
nal for inhibitions to take place” (4 p. 
113) . Following Freud we assume that 
the reaction to an internal danger-situ- 
ation (which is created in our experi- 
ments by the short exposures of threat- 
ening pictures) is analogous to the re- 
action to an external danger-situation. 

In the experiments neither the 
threat nor the anxiety must necessarily 
become conscious in order to provoke 
a defensive reaction. This is effected by 
using the projection of the ego instead 
of inflicting pain or displeasure direct- 
ly on the Ss’ ego (as in many experi- 
ments on perceptual defense) . On the 
other hand there are Ss in whom the 
anxiety is manifest but defensive reac- 
tions are minimal, resulting in reality- 
testing. 

5. There are obvious structural cor- 
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respondences between types of pre-cog- 
nitive organization in our experiments 
on one hand and types of pre-conscious 
organization, i.e. defense mechanisms, 
on the other (1, 3, 4). When, for ex- 
ample, the existence of a threat is ex- 
plicitly denied by the S at one ex- 
posure, we may refer to the defense 
mechanism of denial. If the threat is 
seen to be directed from the hero 
toward the secondary person, we may 
refer to identification with the ag- 
ressor. 

6. The technique of free association, 
which has been the paramount tool of 
psychoanalysis, involves the recon- 
struction of pre-conscious material, 
neatly corresponding with the recon- 
struction of pre-cognitive levels (or of 
P-phases) in the act experiment. The 
concept of defense mechanism was 
mainly derived from data obtained by 
means of the. method of free associa- 
tion. 

We cannot, therefore, agree with G. 
Klein (5, pp. 349-350) who maintains 
that it is “pointless to expect direct 
analogues of ‘repression’ in percep- 


Norte. Markings made by the Ss: pe=person, m=man, k—woman, g—boy, f=girl, +—sym- 
pathetic, ——unsympathetic. Numbers represent age. 





Fic. 1 — Repression (4) and 
isolation (2) 
“A sleeping boy with a violin on his knees. 
A flower pot on the table.” 
(S nr 1, exposure 9) 


Fic. 2—Repression (2) and 
reaction-formation 


“A boy playing with a puppy.” 
(S nr 2, exposure 3) 
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Fic. 3—Isolation (1 and 4) 
“An odd old fellow to the right.” 
(S nr 5, exposure 8) 


Fic. 5—Isolation (double 1) 
and repression (4) 


“A flower in a window.” 
(S nr 5, exposure 10) 


Pre-Cognitive Defensive Organization in a Tachistoscopic Experiment 


Fic. 4—Isolation (double 1) 


‘To the right the boy was seen in a mirror.” 
(S nr 5, exposure 9) 


Fic. 6—Reaction-formation 


“A girl is sitting in the other bank, writing.” 
(S nr 6, exposure 12) 


tion” due to “the differences of meth-the activity of defense mechanisms 


od which both concepts reflect: per- 
ceptual methods are radically unlike 
those which gave rise to the classical 
defense mechanisms”. 

7. In neuroses one of the main char- 
acteristics of the defense mechanisms is 
considered to be that of distorting and 
counteracting reality-testing. Corre- 
spondingly we assume that in the de- 
velopment of percepts in the direction 
of reality (or stimulus) , it is a sign of 


when the threat (on the TAT-like pic- 
ture) is not adequately dealt with in 
terms of reality, in particular at long 
exposure times (in late phases, on late 
perceptual levels), and when a pre- 
ceding relatively correct organization 
is replaced by a less correct one. 


CLASSIFICATION OF PRE-COGNITIVE 
DEFENSIVE ORGANIZATION 


The following classification of pre- 





Fic. 7—Identification with the 
aggressor 


“A soldier during fight. Critical situation,” 
(S nr 7, exposure 4) 


Fic. 9—No defensive type of 
organization 
“‘An unsympathetic man appeared. 
Threatens the woman with a pistol.” 
(S nr 8, exposure 12) 


cognitive defensive organization has 
been undertaken on the basis of draw- 
ings and reports of 260 aviation cadets 
(mainly those who failed in primary 
training) . 

Hero denotes the person who is seen 
by the S at the place of the main per- 
son in the picture. Secondary denotes 


Fic. 10—Turning against the self (1) 
“Maybe patient on the operating-table 
during stomach operation. A male 
assistant, and a female doctor?” 
(S nr 8, exposure 13) 


Fic. 8—Repression (5) 


“A woman sitting at table, being served.” 
(S nr 8, exposure 8) 


na 


the person who is seen at the place of 
the secondary person in the picture. At 
the beginning of many series only one 
person is seen; in such cases we also 
term this person hero on the condition 
that he is not situated at the place of 
the secondary person in the picture. 
The figures illustrate some types of 
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Fic. 11—Isolation (5) 


“A woman at her lace-making table.” 
(S nr 8, exposure 15) 


defensive organization. They are draw- 
ings made by seven of the Ss. It should 
be underlined that all drawings refer 
to one single picture motif. 

Repression 

The hero or the secondary are living 
but not human beings, or lifeless ob- 
jects. 

(1) The hero, or the secondary, or 
both are rigid, inanimate beings. 
Examples: sculpture, bust, statue. 

(2) The hero or the secondary is an 
animal. (Fig. 2) 

(3) The secondary is a “mask”, or 
“disguised”. 

Mixed type (repression and isola- 
tion, 1): “a man disguised as a 
host”. 

(4) The hero or the secondary is an 
object. (Fig. 5) 

Mixed type (repression and isola- 
tion, 2) : lamp, window. 

(5) The hero or the secondary is only 
a part of the body (except for the 
face). (Fig. 8) 


Isolation 


The hero and the secondary are 
separated or isolated from each other. 
The main person, or the secondary 
person, or the threat are not seen. The 
pre-cognitive levels are excluded (iso- 


Fic. 12—Isolation (5) and 
Turning against the self (1) 


“Somebody lying on the floor in the 
foreground.” (S nr 9, exposure 13) 


lated) and there is only one single 

uasi-objective level of cognition. 

4h) The hero and the secondary are 
isolated from each other in the 
visual field. (Figs. 3, 4 and 5) 
Examples: (a) The hero is sepa- 
rated from the secondary bya 
line, a dark surface, or an object. 
(b) The hero turns away from the 
secondary or vice versa. The hero 
uts his fingers into his ears or 
olds his hands before his eyes. 
This type of organization may be 
looked upon as a mixture of iso- 
lation and denial. (c) The per- 
sons are referred to different 
frames; they belong to different 
levels of reality or the secondary 
is a reflected image of the hero or 
they “do not have any connection 
with each other” or one of them is 
in perspective in space, the other 
not. 
Mixed type (isolation and repres- 
sion) : the secondary, but not the 
hero, is a picture; (isolation and 
denial): “they are not conscious 
of each other’s presence”’. 

(2) The secondary is a white spot. 
(Fig. 11) 
Mixed type (isolation and repres- 
sion): lighted lamp, bright win- 
dow. 
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(3) There is no secondary at all 
through the whole series, not even 
in the form of a threatening dark 
part of the field. 

The secondary person is not seen, 
though the main person has ap- 

eared more or less in detail dur- 
ing many previous exposures. 

(4) There is a secondary who is, how- 
ever, neither threatening nor even 
unsympathetic, (Fig. 3) These 
structures represent mixed types 
of isolation on one hand and 
of denial or reaction formation on 
the other hand. 

(5) After one or many persons have 
been seen at one or many previous 
exposures, the persons disappear 
at one or many subsequent ex- 
posures: phase _ discontinuity. 
(Fig. 11) 

Phase discontinuity at one ex- 
posure and in one series only may 
be due to blinking. 

Discontinuity may affect (a) the 
whole visual field (zero phase) , 
(b) hero only (hero discontinu- 
ity), or (c) only the secondary 
(secondary discontinuity), or it 
may affect a dark, threatening spot 
at the secondary person’s place 
(threat discontinuity) . 

Mixed type (isolation, 1 and 5): 
the whole picture is seen as “un- 
real”, “like a dream or a phan- 
tasy”’. 

(6) There are no changes of contents 

all through the series. 
The contents may be essentially 
correct from the first exposure and 
afterward with respect to sche- 
matic characteristics (the hero’s 
age and sex, etc.) . 


Denial 


The threat is explicity denied in the 
S’s report. 

Mixed type (denial and isolation) : 
The secondary is “a threat which has 
not been drawn correctly,” or the 
whole picture is said to represent “a 
puppet show.” In these cases the 
threat is partly denied, and partly iso- 
lated as incorrect. 
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Reaction formation (Fig. 6) 

The threat is turned into its oppo- 
site. 

Examples: The secondary is “nice”, 
“protective” or “friendly” to the hero. 

In some cases it would be difficult to 
make any sharp distinction between 
denial and reaction formation, e.g., 
when the hero smiles or when the 
emotional atmosphere is experienced 
as “pleasant”. 


Identification with the aggressor 
(Fig. 7) 

The role of the main person is re- 
versed, the hero becoming the aggres- 
sor. Mixed type (identification with 
the aggressor and reaction formation): 
the hero protects the secondary. 


Turning against the self 
(Figs. 10 and 12) 

The hero or the hero’s instrument is 
damaged or worthless, or the hero’s in- 
strument is a threat to the hero. 

(1) The hero or his instrument (cor- 
responding with, e.g., the violin in 
the picture TAT nr 1) is a sick, 
bleeding, crying, or dead person, 
or a person committing suicide. 

(2) The hero’s instrument is some- 
thing damaged or broken: a 
ragged toy, a broken tool. In some 
cases the whole picture is seen as 
“an explosion,” “the Guernica.” 

(3) The hero’s instrument is some- 
thing dark and threatening. 

Defensive organization which is not 
directly identifiable as a classical de- 
fense mechanism: Identification with 
a female role. 

(1) The hero is a female person and 
the hero’s instrument or tool a 
rounded or/and hollow object 
(ball, marble, etc.) . 

(2) The hero is a woman or a girl, 
and the hero’s instrument a baby- 
doll in the woman’s (girl’s) arms. 


DIscussION 


It has proved possible to identify, in 
a group of aviation cadets, types of pre- 
cognitive defensive organization previ- 
ously found in compulsion neurotic 
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and (anxiety) hysteric patients, and 
also to associate preliminarily other 
types of pre-cognitive organization 
with the defense mechanisms of denial, 
reaction formation, identification with 
the aggressor, and turning against the 
self. Results obtained from predictions 
of failure and success in primary train- 
ing of aviation cadets on the basis of 
ratings of defensive pre-cognitive or- 
ganization, seem to substantiate the 
assumption that such organization 
may be referred to the classical de- 
fense mechanisms. 

It should be stressed that this analy- 
sis of pre-cognitive defensive organiza- 
tion is a preliminary one. Much re- 
search work with groups of abnormal 
Ss, and with single cases diagnosed by 
psychoanalysts, will be required in or- 
der to state more exactly which type of 
pre-cognitive organization to refer to 
what specific defense mechanism. 

The operational definitions of de- 
fensive organization in this article 
refer to one single exposure (phase) 
only, or to the sequence of two expo- 
sures (phases). We have refrained 
from discussing the questions concern- 
ing the strength of the (single) defen- 
sive organization, the significance of 
the pre-cognitive level, the various 
combinations of pre-cognitive organi- 
zation, and the types of sequence. We 
are working through these areas and 
at the same time trying to establish 
rating procedures for personnel selec- 
tion and for clinical diagnosis (the 
PDO-test, 8) . It is supposed that diag- 
nosis of neuroses via functions (in our 
experiments — pre-cognitive defensive 
organization) should be preferred to 
that via symptoms (1 p. 194) . 


SUMMARY 


In experiments with tachistoscopic 
exposure of TAT-like pictures, expo- 
sure time being repeatedly increased 
until nearly correct recognition, a 
threat to the hero was introduced. 
Types of defensive (visual) organiza- 
tion below the threshold of correct rec- 
ognition are preliminarily classified 
and referred to the defense mechan- 





isms of repression, isolation, denial, re- 
action formation, identification with 
the aggressor, and turning against the 
self. Other types of responses, too, are 
marked off as supposedly defensive. A 
number of facts and arguments for 
associating pre-cognitive defensive 
structures with the classical defense 
mechanisms are discussed. 
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Projective Test Assessment of the Effects of Sleep Deprivation’ 
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Although many investigators (3, 5, 
16) report “alterations in personality” 
among the most important manifest ef- 
fects of prolonged sleep loss, findings 
from specific personality test measure- 
ments are lacking.? The so-called “per- 
sonality changes” attributed to acute 
sleep loss usually refer to three fea- 
tures which are described as nearly 
universally experienced: _hallucina- 
tory experiences, irritability, and 
subjectively felt changes. 

Pilot studies of prolonged sleep loss 
by the present investigators and their 
colleagues revealed the need for more 
precise operational definitions of such 
changes and helped us reach the fol- 
lowing conclusions: 


1. Definition of the stress stimulus 
(e.g., a certain number of hours of 
sleep deprivation) does not constitute 
an adequate operational definition of 
the stress, since it is variably experi- 
enced, reacted to, and communicated 
by different subjects. 


2. Reactions attributed to sleep loss 
have not been adequately screened 


* We are indebted to Major Harold Williams, 
Chief of Clinical and Social Psychology, and 
Dr. Ardie Lubin, Research Psychologist, both 
of WRAIR, Forest Glen, Maryland, for their 
advice on the statistical as well as many other 
aspects of this study; to Dr. David McK. 
Rioch, Director of the Division of Neuro- 
preerr. WRAIR; to Miss Elizabeth 

roomhead, Chief Clinical Psychologist, Na- 
tional Naval Medical Center, Bethesda, Mary- 
land, and to Dr. Marion Handlon, Staff Psy- 
chologist, Chestnut Lodge Sanatarium, Rock- 
ville, Maryland, for their blind analyses of 
our Rorschach records; to Mr. Jim Kincan- 
non for his analysis of our Iflund results, and 
Miss Betty Shanks of WRAIR for her Devos 
scoring of our Rorschach records. 

* Tyler (15) reported that group Rorschach 
examinations given before and after 100 
hours of wakefulness showed no significant 
changes. 


from reactions to the experimental set- 
ting which also is variably experienced. 
For example, to some subjects, the 
Walter Reed experimental setting was 
a bonanza of free snacks and attention 
for which sleep loss was a small price 
to pay; for some subjects it was a frus- 
tratingly restrictive interlude, where 
deprivation of privacy, freedom of 
movement, decent clothing and wom- 
en, was a heavy addition to the load 
of sleep deprivation; for others it was 
primarily a challenging, competitively 
stimulating experience, and for still 
others the setting was relatively unim- 
portant. 

Various methods of handling the 
subjects is another aspect of the ex- 
perimental setting which may affect 
the kind of “personality changes” 
found. For example, in the Walter 
Reed project where the subjects were 
handled gently and permissively, in- 
creased irritability was not clinically 
manifest. Similarly, the incidence and 
kind of hallucinatory experiences may 
be affected. Myths about what happens 
to subjects during prolonged loss are 
transmitted in the literature and may 
affect the results through both staff and 
subjects. Further, the interview meth- 
ods used may channel what is commu- 
nicated so that stereotyped interactions 
are set up which elicit biased answers. 
For example, what one examiner asks 
may influence a sleep-deprived subject 
to reply in ways that will class his ex- 
perience as a hallucination whereas 
another questioner in another setting 
may elicit mere sensory disturbance re- 
ports from him. Similarly, the ques- 
tioner may inadvertantly affect the in- 
cidence of hallucinations reported in 
the different sensoria. 


3. Precise definitions of the non- 
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verbal, behaviorally non-overt, subjec- 
tively felt changes are especially ham- 

red by linguistic limitations and 
here, just the process of describing to 
another person may alter the felt state. 

4, Testing, as with describing sub- 
jectively felt changes, may alter the 
state being tested. Those who have 
worked with ee i subjected to pro- 
longed sleep loss have been impressed, 
as we were, with the subject's rs med 
to pull himself together, alerted by the 
stimuli of test and tester, and to re- 
spond more or less as he did before 
sleep deprivation, no matter how he 
appeared or behaved between tests. 

5. Finally, we are not sure precisely 
what personality tests measure, nor is it 
known whether or how they would be 
sensitive to short-term alterations of 
personality functioning. 


This exploratory paper deals with 
two major types of questions: 

1. What, if any, changes do projec- 
tive tests show which are attributable 
to sleep loss? 

2. From the projective test perform- 
ance before sleep deprivation, can 
sleep deprived behavior be predicted? 
Of the many psychological and physi- 
ological aspects of sleep deprived be- 
havior which might profitably be in- 
vestigated in this way, for only the fol- 
lowing two were criteria available: 

a. How well will the sleep deprived 
subject maintain his mental and psy- 
chomotor efficiency? 

b. Which sleep deprived subjects 
will hallucinate? 


THE RESEARCH SETTING 


In the 1957 WRAIR sleep depriva- 
tion project, a research group com- 
posed of neurophysiologists, psychia- 
trists and psychologists pak the ef- 
fects of 98 hours of sleep deprivation 
on 25 enlisted men from an Army field 
hospital training unit who volunteered 
for the experiment. 

The subjects reported in groups of 
ten for a twelve-day period. Each of the 
five groups was divided into five ex- 





perimental and five control subjects 
roughly matched for intelligence. After 
an orientation day, there were three 
control days. From the beginning of 
the fifth day, the experimental group 
was kept awake for 98 hours. Following 
a one-day break during which the ex- 
perimental group were sleeping, there 
were three more control or recovery 
days. 

The 25 experimental and 25 control 
subjects were confined to a suite of 
rooms from the first orientation day 
through the last recovery day. Both 
groups were in hospital garb. The 
suite of rooms was equipped with ma- 
terials for handicrafts and games, with 
television and radio sets, books and 
magazines and a refrigerator for 
snacks. 

A nurse, several medical corpsmen, 
and members of the staff kept the men 
under close observation, especially dur- 
ing the sleep-deprivation _ period. 
Those persons and the subjects them- 
selves helped those of the men who 
needed it to stay awake during their 
sleepiest phases by walking, playing 
games or talking with them. The atti- 
tudes of the subjects toward the ex- 
periment and the staff were excellent 
for the most part, with no one drop- 
ping out of the experiment before its 
completion. 


EXPERIMENT DESIGN 


Four projective tests were adminis- 
tered to the fifty subjects on three oc- 
casions spaced as follows, so that each 
subject’s sleep deprived performance 
could be evaluated in the light of (a) 
his own performance before sleep loss 
and after his “recovery” sleep, and (b) 
the performance of roughly matched 
control subjects given the same testing 
schedule but allowed to sleep eight 
hours each day: 

1. Baseline test (on the third control 
day) . 

2. Sleep deprivation test (on the last 
experimental day after some 80 hours 
of wakefulness on the part of the ex- 
perimental subjects) . 
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3. Recovery test (on the third recov- 
ery day). 

Exactly the same form of the four 
tests were readministered in the same 
order in the same testing room, and for 
the most part, by the same author. The 
three pep ey ey of a given subject 
in each of the four tests were scored at 
the same time to promote consistency 
in scoring. 

The four projective tests used’ were: 

1, The Rosenzweig test (14) , scored 
according to the directions in the man- 
ual. 

2. The Iflund test (6), analyzed for 
shifts in order of recall. 

3. The Glad test (4) , scored for “in- 
terpersonal” qualities in verbalization 
according to the directions in Leary’s 
manual (8). The scoring was done 
blind, and the order in scoring the 
three performances of each man was 
randomized as was the order in scoring 
the experimental and control Ss. 

4, The Rorschach test (13) , analyzed 
on the basis of 


a. Scores: 


(1) Thirty-eight different standard 
scores,* ratios, and percentages were 





* The Rorschach test, in which the authors 
were most interested and experienced, was 
given more exhaustive treatment than the 
other three projective tests. Even the experi- 
mental design was more appropriate for the 
Rorschach than for the other tests. For exam- 
ple, the Iflund test as used here could not be 
submitted to an expert for blind differentia- 
tion of experimental test from experimental 
retest, or experimental test and retest from 
control test and retest, on the basis of sleep 
deprivation effects, because one could tell at 
a glance which was which from the number 
of cards recalled at first showing. 


*The 38 separate standard Rorschach scores 
analyzed were t,, tnon-c’, aV-t, #R (number of 
responses) , number of rejections, M. FM, m, 
FC, CF + C, c (all shading) , #P, P%, W%, 
Dd%, S% (all space responses), F%, F + 
%, extended F + %, H%, A%, Anat.%, % 
other stereotype, EBY (ratio of M + Sum C 
to R), eb% (ratio of FM + m+4ctoR),M% 
(ratio of M to M + Sum C), M’% (ratio of 
M to M + FM), M”% (ratio of M to M + 
FM + m), FM’% (FM + mtoFM + mo), 
C% (Sum C to Sum C + Sum c), %R to last 
3 cards, W’% (ratio of A+ HtoA +H + 


calculated. Having first agreed on a 
common scoring system and spot- 
checked for reliability of scoring, each 
author scored the records of the sub- 
jects she had tested. 

(2) The affective tone of the con- 
tent was quantified by the Devos 
method (2). 


b. Clinical interpretations: 


(1) By the authors: 

(a) A-posteriori interpretive analy- 
sis of differences between sleep de- 
prived and non-sleep deprived per- 
formance. 

(b) Prediction of which sleep de- 
prived subjects would hallucinate. 

(c) Subsequent a posteriori qualita- 
tive description™ of differences be- 
tween the baseline Rorschach perform- 
ance of the subjects independently 
rated as “hallucinators” and “non- 
hallucinators.” 


(2) By two consultant Rorschach 
experts:! (whose rey gen were 
“blind” in that they had no contact 
with the experiment other than the 
Rorschach protocols, but were given 
the description of the research setting, 
so to that extent were working in con- 
text) . 

(a) Selection of the sleep deprived 
protocol from pairs of records of the 
experimental subjects. One consultant 
was given 12, the other 13, randomly 
selected protocols of the 25 experimen- 
tal subjects in pairs and told that one 
of each pair was given before, and the 
other after 80 hours of sleep depriva- 
tion. Obvious retest clues such as “I 
remember this from last time,” and re- 
action time were deleted. 

(b) Selection of the experimental 
pairs from the control pairs. Each con- 
sultant was given the experimental 
pairs she had not seen before plus an 
equal number of control pairs, was 
told these would be presented to her in 
random order. Nothing but obvious 
sleep deprived remarks were deleted 





Ad + Hd), A + H — Ad — Hd, M + Sum C, 
M — Sum C, FM + m + c, FM + m, —c. 
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since the order of administration with- 
in each pair was given. 

i) Prediction from the subject’s 
baseline record of how well he would 
stand up under the stress of prolonged 
sleep deprivation in terms of a rating 
scale> hereinafter referred to as the 
h-rating. 

(d) Prediction of which subjects 
would hallucinate when sleep de- 

rived. Here the consultants per- 
ormed two operations: 


1. First they were asked to predict 
whether or not the subject would hal- 
lucinate under the stress of sleep de- 
privation and to state the reasons for 
their predictions. No other directions 
were given. 

2. Two months later each was again 
given the baseline Rorschach protocols 
of the 16 men for whom independent 
hallucination ratings were available 
(see Hallucination Criterion below) 
along with both the a-posteriori de- 
scriptive summary® of the differences 
between the baseline Rorschach rec- 
ords of the subjects classified as hallu- 
cinators and those not so classified, and 
a checklist? derived therefrom, then 
asked to: 

(a) Rate these 16 subjects on each of 
the seven items on the checklist. 

(b) Label each subject diagnostical- 
ly. In addition. they were asked to state 
whether “hysteric traits” or “obsessive 
traits” were more prominent, regard- 
less of the diagnosis. 

(c) Predict which subjects would 
hallucinate after prolonged sleep loss, 
basing their judgment on the authors’ 
descriptive summary. They were given 
no information as to the success of 
their prior predictions. 





5 Place each subject on a six point scale as to 
how well he will probably stand up under the 
stress of prolonged sleep deprivation, rating 
as “6” the subjects you think most likely to 
hold their own or improve on their pre-sleep 
deprived efficiency level in mental and psy- 
chomotor tasks, and as “1” the subjects you 
think most likely to fold under stress. 


* Cited in footnote 14. 


7 Cited in footnote 15. 





CRITERIA 


1. Hold (H) Criterion: The follow- 
ing six tests were used to assess how 
well the subjects maintained their own 
mental and psychomotor efficiency 
level on retest because they showed 
significant mean decrement with sleep 
loss, and were administered at about 
the same times as the projective tests to 
a large enough number of the men so 
that each of the 25 experimental and 
25 control subjects had been given at 
least three of the six tests. 

a. An auditory vigilance task requir- 
ing the subject to press a lever when 
and only when a critical signal oc- 
curred (17). 

b. A reaction time task requiring the 
subject to respond as calle as possi- 
ble to a visual signal (17). 

c. A concept attainment task requir- 
ing the subject to determine the prop- 
erties that defined class membership 
(17). 

d. An adding task requiring the sub- 
ject to add pairs of single digit random 
numbers as fast and as accurately as 
possible (9). 

e. An immediate recall task requir- 
ing the person to recall all the Iflund 
cards after as few re-showings of the 
cards as possible (9) . 

f. A rhythm task requiring the sub- 
ject to press a button at various rates 
ranging from once per second to once 
every eight seconds (10) . 

Sleep deprivation did not have a 
general effect on any given subject in 
all these tests; it was not uncommon to 
find a sleep-deprived subject who im- 

roved as well as the best control sub- 

ject on one of these tests, yet deterio- 
rated drastically on another. Neverthe- 
less, it was possible to rate the men on a 
six-point scale, hereinafter referred to 
as the H-criterion, ranging from those 
who held their own or improved in 
most of these tests (rated as “6’’), to 
those who fell down on most of them 
gem as “1’’”) . Independent rating of 
the men in this respect by one of our 
statistical consultants compared re- 
liably with the authors’ ratings. 
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2. Hallucination Criterion: From 
tape recording of sleep-deprived Psy: 
chiatric interviews plus the clinical o 
servations of staff members, 16 of our 
25 experimental subjects were rated by 
a staff psychiatrist and psychologist 
(11) on a six-point hallucination rat- 
ing scale. This ranged from “no re- 
ported changes” through “illusory 
changes” to reported experiences 
where the subject labelled or inter- 
preted the illusory change and was 
enough confused about, or, for a time 
at least, convinced of the reality of the 
experience to act upon it (e.g., not just 
that spots on the linoleum floor 
seemed to the subject to move, but that 
he thought they might be bugs and 
tried to stamp on and squash them) . 
Only the latter subjects were classed as 
“hallucinators.” 


RESULTS 


I. Sleep loss effects on projective test 
performance. 


Our over-all conclusion from the re- 
sults of repeated projective tests® is 
that prolonged sleep loss leaves person- 
ality structure essentially unchanged, 
but does modify how, and how much 
of himself, the subject communicates. 
The sleep deprived men changed sig- 
nificantly more than did the controls 
on retest, but like the controls, how 
they changed was a complex function 
of the interaction of their long term 
personality adjustment with the whole 
experimental setup, and not of the 
sleep loss alone. 


Findings which led to these conclu- 
sions: 


a. From statistical analysis of scores:® 





*It may be that the first experience with a 
projective test sets up a pattern of response 
so that subsequent use of that test material is 
colored by this initial experience and remem- 
brance. An alternative explanation of the 
paucity of change found here, and the one we 
favor, is that these tests measure aspects of 
personality structure, which is fairly stable. 

* Since we had no evidence on which to base 
a priori hypotheses about score changes to 
be expected with sl deprivation, only 
two-tailed tests of significance were used. 


(1) Only three of the more than a 
hundred different test scores showed 
changes in central tendency significant 
at the .05 level of confidence or bet- 
ter:1° 


(a) The Rorschach MY (ratio of M 
to M + Sum C) increased with sleep 
deprivation (.01 level of confidence) . 

(b) The sado-masochistic Iflund 
card was recalled earlier after pro- 
longed sleep loss. (.01 level of confi- 
dence.) 

(c) The Devos Rorschach N% (% 
of responses neutral in affective tone) 
increased on the second, sleep deprived 

eriod testing to about the same extent 
or both the experimental and control 
subjects but the experimental subjects 
increased their NY% even more on the 
third, recovery period performance 
whereas the control subjects returned 
to their baseline levels.1! (.02 level of 
confidence.) ~ 


Since three out of 100 changes might 
easily appear significant by chance, any 
or all three of these may be spurious 
results. Further, the clinicians’ analy- 
ses did not support the hypothesis that 
these findings were attributable to 
sleep loss. 

(2) The Rorschach test showed no 


cumulative over-all effect of sleep de- 
privation on scores.12 


(3) Only two of the more than a 





2° Kendall’s W was used to test for trends from 
baseline through sleep deprivation to the 
recovery period. 

11 Tyler’s finding (16) that “tests on the morn- 
ing after recovery sleep showed no changes 
from pre-experimental control values other 
than those due to learning” was not the rule 
in our data. Further, clinical evidence of 
delayed reactions to sleep loss and of con- 
siderable individual variation in rate of re- 
covery, reported by our staff observers, sup- 
ports our impression that the recovery 
period performance cannot be assumed to be 
a “recovered” ormance, thus cannot be 
used as a second sample of baseline per- 
formance. 

12 The nature of the tests, and our lack of a 
priori hypotheses as to the direction, if any, 
toward which sleep deprivation would move 


the scores, preclude an adequate assessment 
of the possibility that prolonged sleep loss 
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hundred scores showed changes from 
baseline to sleep deprived perform- 
ance that were differentially related to 
their respective baseline scores: 

(a) Subjects who gave less than 20 
baseline responses (#R) tended to 
give even fewer responses after pro- 
longed sleep loss (chi-square test, .02 
level of confidence) . 

(b) Subjects who gave less than six 
popular responses (Beck P) (1) tend- 
ed to give a decreased #P after pro- 
longed sleep loss (chi-square test, .01 
level of confidence) . 

Although no such test-retest rela- 
tionships proved significant for the 
control subjects, again two out of 100 
scores might easily show a significantly 
curvilinear test-retest relationship by 
chance. Furthermore, in most cases the 
change was very small, often amount- 
ing to a difference of only one R or P, 
so that, for a sample the size of ours, 
replication would be especially neces- 
sary. In the case of the #R, replication 
with the same results would still not 
constitute evidence that the change is 
a function of sleep deprivation, since it 
has been our experience that the retest 
#R tends normally to be curvilinearly 
related to the original #R, i.e., that 
subjects with low R tend to decrease 
their number of responses on retest 
whereas those with relatively high 
number of responses are more apt to 
hold their own or increase, contrary to 
the more usual homeostatic tenden- 
cies. The fact that this was found to a 
significant degree for the sleep de- 
prived subjects and not for the con- 

trol subjects suggests that the sleep 
deprivation effect, if any, was an ac- 





might have an over-all cumulative effect on 
central tendency even though so few of the 
separate scores changed significantly. In the 
case of the Rorschach test, however, we haz- 
arded a guess that sleep loss would, if any- 
thing, increase the mean of the reaction 
time, the A%, the S%, other stereotypy, and 
the number of rejections, and lower the 
mean #R, the F + %, the PY, the C%, the 
M’%, and the W’%. Accordingly all these 
variables were added to form a single sum 
score, the Kendall W of which was not sig- 
nificant. 


centuation of an already existing rela- 
tionship. 


(4) The experimental subjects 
showed a greater absolute amount of 
test-retest change than did the control 
subjects, not only, and not even especi- 
ally, on test scores whose central ten- 
dencies changed most. 

(a) For every one of the 38 Ror- 
schach standard scores, the sum of the 
absolute amount of change was greater 
for the experimental than for the con- 
tre] subjects. 

(b) For only four scores, however, 
was the absolute change significant 

(Mann-Whitney U test) : 

1.H% (.001 level of confidence) 

2. Average reaction time (.01 level of 

confidence) 

3.C% (Sum C to Sum C+Sum c) 

(.01 level of confidence) 

4.Dd% (.01 level of confidence) 

(c) A crude discrimination score, 
computed by summing these four 
scores was 70 per cent successful in dis- 
criminating the experimental from the 
control subjects when the median was 
used as a cutting point, and the opti- 
mal cutting point was 82 per cent cor- 
rect. 


b. Findings from the consultants’ 
analyses: 


(1) Neither consultant had better 
than chance success in discriminating 
between the baseline and sleep de- 
prived performance of the experimen- 
tal subjects. Both reported that the de- 
cisions were difficult, often having to be 
forced, because (a) in a third of the 
cases they found essentially no differ- 
ences between the two records of the 
same man, and (b) when they did 
consider the records enough changed 
to suggest that something other than a 
brief lapse of time occurred in the 
intertest interval, the changes were 
still relatively slight, never basic, and 
seldom simply or consistently in a 
single direction. 

Each consultant reported independ- 
ently that she usually selected the less 
adequate as the sleep deprived per- 
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formance. We know of no established 
criteria for adequacy of adjustment 
but, to the extent that their value 
judgments were valid, their negative 
results suggest that sleep deprivation 
does not make for less adequate Ror- 
schach performance. 

However, the value of these negative 
findings as evidence that sleep depriva- 
tion does not significantly change Ror- 
schach performance is weakened by the 
following methodological artifact: the 
order of administration was not given 
because the consultants knew that the 
retest was always the sleep deprived 
performance. In essence their goal was 
discrimination of test from retest on 
the basis of sleep deprivation effects; in 
practice they sometimes found them- 
selves deciding which of the two rec- 
ords was sleep deprived on the basis of 
retest effects, in spite of conscientious 
efforts to the contrary. 

To the extent that this. occurred, 
consciously or unconsciously, their 
analysis in this operation became an 
unmeasurable mixture of a priori and 
a posteriori judgments about sleep loss 
effects. Thus even had the discrimina- 
tions been highly successful, they might 
have had little to do with sleep depri- 
vation. The varying accuracy of their 
judgments about retest effects further 
complicates interpretation of their 
negative results on this operation. The 
unreliability of retest clues is seldom 
recognized, so deceptively logical do 
they appear on hindsight. For exam- 
ple, in a few of these cases, the authors 
had feared that even after deletion of 
reaction time and obvious retest com- 
ments, the practice effect was so obvi- 
ous it would give the show away, 
whereas the consultants reported re- 
test bias in the opposite direction. 


(2) The consultants were 68 per 
cent'® correct (significant at the .05 





**’ While on the surface it appears as if the 
clinicians were less successful than the sum 
of the scores H%, t, C% and Dd% in their 
discrimination of experimental from con- 
trol subjects, the reverse is probably the 
case since their judgments are a priori 
whereas the scores were the a posteriori dis- 


level of confidence) in differentiating 
the experimental from the control rec- 
ords, given the baseline and retest rec- 
ord of each subject in pairs, with the 
order of administration correctly la- 
belled. Each reported independently 
that she had leaned more heavily on 
the amount than on the kind of 
change, and in fact, most of their errors 
in judgment consisted in labelling as 
“experimental,” control cases where 
they found appreciable change, and 
labelling as “control,” experimental 
cases in which they found relatively 
little change. They reported ——- 
able test-retest differences significantly 
more often in the experimental pairs 
than in the control pairs, (17 experi- 
mental vs. 10 control) . No significant 
generalizations emerged from their de- 
scriptions of what to them constituted 
“appreciable change” but partial clari- 
fication comes from the fact that in 8 
of the 25 experimental pairs and only 
3 of the control pairs they described 
the “appreciable change” largely in 
terms of what they called signs of fa- 
tigue (e.g., compared with his original 
performance the man became slap- 
happy, more careless, showed depletion 
or conservation of energy) . 


c. Authors’ a posteriori analyses: 


(1) More of the experimental than 
control records appeared appreciably 
changed. In one third of the Rorschach 
records of the experimental subjects 
and two thirds of the control subjects, 
essentially no test-retest differences ap- 
peared. 

(2) For the rest, where changes did 
appear, 

(a) In no case was there basic 
change in personality structure. Some 
subjects showed more of themselves 
and some less, some loosened, others 
tightened their controls, but it was the 
self-expression and not the self ex- 
pressed which changed. 





criminations of four scores selected out of 
38, and it is very likely that the score-based 
per cent of correct clasification would be 
decreased on a cross validation sample. 
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(b) The experimental subjects 
changed more than did the control sub- 
jects but no kind of sleep deprived 
change occurred which was not also 
found among the retest changes of con- 
trol subjects. 

(c) No kind of change occurred oft- 
en enough to be considered a character- 
istic sleep deprivation affect. Each sub- 
ject seemed to be changing in his own 
way, which appeared to be a complex 
function of the interaction of his base- 
line personality adjustment with the 
whole experimental set up and not 
a function of sleep loss alone. 

(d) No kind of change occurred sig- 
nificantly more often among the ex- 
perimental than among the control 
group with the following possible ex- 
ception, which at least partially de- 
scribed the change in self-expression or 
communication: while both the ex- 
perimental and control subjects often 
showed the retest practice effect of ver- 
balizing their perceptions in less detail 
but more glibly, in 7 of the experimen- 
tal cases and only 3 of the control cases, 
retest communication seemed either 
more drastically curtailed and con- 
densed than normal, or, if less cur- 
tailed, the added verbiage was more 
redundant than enlightening, reveal- 
ing an “anything goes” attitude. The 
fact that, with one exception, these 
were the same men reported by the 
consultants as “fatigued” indicates re- 
liable spotting of essentially the same 
quality. 

(e) Only one generalization pre- 
dicted the test-retest changes of all the 
subjects, experimental and control, 
better than the hypothesis that there 
would be essentially no change: the 

task-involved subjects who (either be- 
cause of inner standards or “other” 
directed drives to please) marshalled 
their resources at the first challenge of 
the relatively unfamiliar, unstructured 
Rorschach test, either held their own 
or increased in this respect on retest. 
Those who were little interested, chal- 
lenged, or involved originally, tended 
to fall even farther short of their pro- 
ductive capacity on retest. 





Reliability was high between the 
way the authors rated the subjects in 
this respect and the consultants’ h-rat- 
ings, which in turn correlated signifi- 
cantly with how well the man stood up 
on retest in a variety of nonprojective 
tasks (H-criterion) . Further, it will be 
recalled that the #R, which was found 
to be the most important single indi- 
cator of the h-rating by all four clini- 
cians, showed a small but significant 
curvilinear test-retest relationship. 

(f£) Those who changed most in the 
group as a whole, experimental as well 
as control, appeared in general to be 
the more conflict ridden, loosely de- 
fended neurotics (and not, as we had 
anticipated, those with the most schizo- 
phrenic-like records, who were among 
those showing least change). There 
was insufficient agreement among the 
authors and the two consultants to 
substantiate this general impression 
adequately. However, the psychiatrist 
(12) who conducted and analyzed the 
psychiatric interviews of these subjects 
reported that it was the loosely defend- 
ed subjects who showed the greatest 
sleep deprivation effects. Further, he 
too concluded that personality struc- 
ture was essentially unchanged by sleep 
deprivation when he found that the 
factors which differentiated the sleep- 
deprived from the control interviews 
were clearly present in the baseline 
Rorschach personality assessments. 

(g) Reliability among the clinicians 
as to which subjects did and did not 
show essentially any change was high: 

82% agreement among authors and 

consultants 

84% agreement between consultants 

68% agreement between clinicians 

and optimal cutting point of 
4-score sum above described. 


II. Prediction Results. 


Using the Rorschach alone, it was 
possible to predict (a) to what extent a 
man would maintain performance on 
a wide variety of tasks during the slee 
deprivation period, and (b) whi 
men would report hallucinations and 
which would not during the sleep de- 
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privation period. However, the fact 
that the retest efficiency of the control 
subjects was predicted equally success- 
fully by the clinicians raises the ques- 
tion of how much of the predicted ef- 
fects are attributable to sleep loss. 


Findings which led to these conclu- 
sions: 

1. Prediction from the baseline Ror- 
schach performance of how well the 
man would hold up (H-Criterion) : 


a. Only one of the projective test 
scores, the number of responses (#R) , 
differentiated the subjects who stood 
up well under sleep deprivation from 
those who showed decrement in most 
of the six tasks of the H-Criterion sig- 
nificantly better than chance (r = .42, 
.05 level of confidence). The #R it 
will be recalled also predicted its own 
direction of change in the testing in- 
strument itself. 

b. The consultants’ “blind” a priori 
predictions of how well the subjects 
would stand up under stress (the h- 
ratings) correlated with the H-Crite- 
rion significantly better than chance 
(r = .43, .05 level of confidence) . 


Neither of these results can be as- 
sumed to be predictions of sleep loss ef- 
fects since: 

(1) The consultants’ h-ratings of the 
control subjects predicted their retest 
efficiency (H-Criterion) at the same 
level of confidence as for the sleep de- 
prived subjects. 

(2) One out of 100 score changes 
might well appear significant by 
chance, While the h-rating finding ap- 
pears to support the #R finding, es- 
pecially since the clinicians found the 
#R their most important single indi- 
cator of h-rating, it is supported not as 
a sleep deprivation effect but as a nor- 
mal predictor of test-retest change. 
Further, though the #R did not pre- 
dict the H-Criterion for the control 
subjects significantly better than 


chance, there was a tendency in that 
direction (r = .23). The fact that #R 
predicts the H-Criterion significantly 
better than chance for the experimen- 


tal and not for the control subjects 4 
pears largely a function of the signifi- 
cantly greater absolute amount of 
change in the experimental subjects on 
retest in both the projective and non- 
projective tests. 


2. Hallucination predictions: 


a. Only one of the projective test 
scores, the number of Beck popular re- 
sponses (#P), successfully differeni- 
ated the subjects rated as hallucinators 
by other staff members (Hallucination 
Criterion) from those not so rated (r 
= .62, .02 level of confidence). The 
#P it will be recalled also successfully 
predicted its own direction of retest 
change for the sleep deprived subjects 
in the testing instrument itself. 

Although again one out of a hun- 
dred relationships might easily appear 
significant by chance, it is not sur- 
prising that a relatively small #P (be- 
ow average perception of things as 
others see them) , which is one of the 
classic Rorschach “signs” of schizo- 
phrenia, should be a correlate of our 
“hallucinators.” It cannot be assumed, 
however, that the hallucinatory experi- 
ences of sleep deprived subjects are the 
same phenomenon as those of schizo- 
phrenics nor that it was primarily those 
with schizophrenic personality traits 
to start with who hallucinated with 
sleep loss. Eleven of the 16 men for 
whom hallucination ratings were avail- 
able were classed as hallucinators ac- 
cording to the criterion used here. 
While most (4 of the 5) subjects with 
schizophrenic features in their Ror- 
schach records were classed as hallu- 
cinators, the converse did not hold; 
most of the “hallucinators” did not 
give Rorschach records, baseline or 
sleep deprived, with prominent schizo- 
phrenic sigas. Nor were intelligence, 
task-involvement, or maintenance of 
efficiency levels, correlates of the hallu- 
cination classification; neither the 
ACB scores nor the h-ratings or H-rat- 
ing correlated significantly with the 
hallucination classification. 


b. The consultants classified only 
56% of the subjects correctly according 
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to the Hallucination Criterion used 
here, but they were referring to the hal- 
lucinations of schizophrenics. Each re- 
ported independently that she was pre- 
dicting that a subject would halluci- 
nate if and only if schizophrenic Ror- 
schach signs were prominent in his 
record. Four of the five subjects thus 
selected by them as probable hallucina- 
tors were so classed by the Hallucina- 
tion Criterion and the fifth was con- 
sidered a chronic denier and a proba- 
ble hallucinator by our colleagues. 


c. The authors, who agreed in their 
predictions as to whether each of the 
16 subjects would or would not be 
classed as a hallucinator, were 90% 
correct in their predictions, but they 
were predicting not just which subjects 
would hallucinate under stress, which 
was what the consultants were asked to 
do, but which subjects would be likely 
to report or reveal hallucinations in 
this setting. It seemed very unlikely 
that such a list would include all those 
and only those who would have hallu- 
cinatory experiences during prolonged 
wakefulness in any setting. Some of the 
latter might defensively deny them. On 
the other hand, among those who do 


report hallucinations during pro- 
longed wakefulness, some of the so- 
called “hallucinatory experiences” 


might easily be artifacts of interperson- 
al interaction; the subject has to cope 
not only with his disturbingly un- 
formed, unusual sensations, associa- 
tions, affect, etc., but also with the dif- 
ficulty of describing them to the inter- 
viewer. Thus even without the biases 
and anxieties of the interviewer, some 
subjects might be expected to resort to 
the anxiety-reducing maneuver of con- 
cretizing into “hallucinations” what 
were merely vague sensory disturb- 
ances. The fact that past and current 
reports of hallucinations during sleep 
deprivation was a matter of common 
knowledge to our subjects facilitated 
such concretizing tendencies. 

d. When they were later asked to 
base their hallucination predictions on 
the authors’ a posteriori descriptive 
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summary!‘ of the difference between 
those classed here as hallucinators and 
those not so classed, one consultant was 
69% correct, the other 81% correct. 
The two consultants agreed in 12 of 
the 16 cases (75%) and in 11 of the 16 
cases (69%) the authors and both con- 
sultants labelled the men the same in 
this respect. Thus the descriptive sum- 
mary, in spite of its generalities and 
wide use of non-specific terms, was use- 
ful, indicating that it is possible to pre- 
dict the subject’s hallucination classifi- 
cation reliably as well as significantly 
better than chance. 

However, the attempt to evaluate 
the relative importance of the descrip- 
tive summary components as predictors 
and to arrive at a more precise opera- 
tional definition by making a specific 


14 “Non-hallucinators” appeared to know what 
they were doing and gave “reality-bound” 
responses, the kind of banal, ordinary re- 
sponses usually given for the areas the sub- 
ject used. They usually gave healthy records 
from the point of view of controls and de- 
fenses. When they gave minus responses, 
they seemed less sure of these or showed 
other evidence of realistic critical discrimi- 
nation. 

“Hallucinators” showed one or more of the 

following: 

a. Personal, idiosyncratic material, odd or 
unusual content, or weird percepts 
emerged; the basic percepts may have 
been banal or plus but were personally 
elaborated, or the elaborations were clear- 
er than the basic percept or the person 
elaborated on his associations more than 
on the basic percept. 

b. The subject’s style of handling the test 
situation and his experiencing of his own 
ideas was an unclear event. Some sounded 
as if they were never quite sure what was 
expected, not sure but what they were just 
guessing all the time, unsure of their per- 
cepts or their terms; they showed impre- 
cise use of language; their ideas trailed off. 

c. The hysteric type personality was more 
apt to be classed as a “hallucinator” than 
the well-defended obsessive; i.e., not a 
clearly differentiated world to which he 
responds; a propensity for intrusive 
thoughts and affect to emerge into his 
thinking and be reacted to as if they were 
in the outer world. (Here not necessarily 
CF but openly noticeable in content, 
where a banal or pleasant content may 
end up dangerous or vice versa; incom- 
patible affect breaks through, not an in- 
tegrated part of the associative process. 
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checklist,1® securing diagnostic classifi- 
cations, and the rating of hysteric vs. 
obsessive defenses, fell far short of suc- 
cess. Not one of the consultant’s ratings 
or nosological classifications showed 
better than chance agreement either 
between consultants or with the ratings 
of the authors on this checklist. Nor 
did the consultants appear to be using 
the checklist, since in a few cases, their 
predictions as to whether or not the 
man would hallucinate ran counter to 
most of their ratings on the checklist 
and in even more cases, their ratings on 
the same man were about equally di- 
vided among scores which would have 
put the man in the hallucinator, the 
non-hallucinator, and undecisive class, 
according to our descriptive summary. 
Further, their cumulative ratings for 
each man on these various checklist 
items proved neither a valid nor a reli- 
able predictor. 


CONCLUSIONS AND SUMMARY 


1. Personality structure, as reflected 
in the projective tests used here (Ror- 
schach, Rosenzweig, Iflund and Glad 
tests), is unaltered by acute sleep de- 
privation, and personality functioning 
is changed relatively little. 

a. There was little test-retest change 
for the group as a whole, experimental 
as well as control, when consideration 
is given to the large number of scores 
and ratings statistically tested in this 
study. 

b. More of the sleep deprived than 
control subjects changed, and changed 
more, on retest, but for both groups it 
was the style and content of communi- 
cation and not the personality struc- 
ture which changed. 





** Rate each subject on a three-point scale as 
to: 

. Sureness of percepts. 

. Sureness of performance. 

Relative sureness about plus and minus 

responses. 

. Per cent of unusual content. 

. Per cent of personally elaborated con- 

tent. 

Presence of intrusive thoughts and af- 

fect. 

. Strength of defenses and controls. 


4 
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c. Where changes were found, they 
appeared to be a complex function of 

e interaction of the subject's long 
term personality adjustment with the 
whole experimental set up and not a 
function of sleep loss alone. 


d. The increased irritability alluded 
to in other sleep deprivation clinical 
reports was not found in this study. 
Our experiences suggest that some re- 
actions atttributed to sleep loss are 
largely a function of the experimental 
setting. 

2. Using the Rorschach test alone, it 
was possible to predict (a) how well 
the subject would maintain his effici- 
ency in a variety of mental and psycho- 
motor tasks during the sleep depriva- 
tion period, and (b) whether he would 
hallucinate. Although these prediction 
results were shown to be more con- 
vincing evidence of the predictability 
of behavior from personality structure 
than of sleep deprivation effects, the 
following information about the latter 
emerged: 


a. The subjects whose baseline Ror- 
schach performance showed high moti- 
vation, either because of inner stand- 
ards or strong drives for approval, 
tended to maintain or raise their men- 
tal and psychomotor efficiency levels on 
retest; those not so motivated tended 
to show retest decrement in a variety of 
non-projective psychological _ tests. 
While this relationships prevailed 
equally among both the experimental 
and control subjects, the decrement 
was accentuated by sleep deprivation. 


b. It was the loosely defended neuro- 
tics, whether experimental or control, 
who showed most test-retest change in 
personality functioning, but again, 
among the loosely defended neurotics, 
it was the sleep deprived subjects who 
showed greater incidence and degree of 
change. 

c. The men who reported hallucina- 
tions during prolonged wakefulness 
were not typically those who showed 
mental and psychomotor decrement 
nor those whose personality function- 
ing changed most, but usually fell into 
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one of the following two groups: (1) 
those characterized by a loosely defend- 
ed hysteric-like style of adapting and 
(2) those with prominent schizophre- 
nic traits. 
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The Structured Doll Play Test as a Projective Technique 
for Use With Children’ 


Davin B. LYNN AND ROSALIE LYNN 
University of Colorado School of Medicine 


Despite much doll play research lit- 
erature and evidence for the diagnos- 
tic usefulness of doll play procedures, 
no doll play technique has evolved 
employing simple, standardized, easily 

uantifiable characteristics of projec- 
tive tests. 

This paper describes the Structured 
Doll Play test (SDP)? which is de- 
signed specifically as a clinically use- 
ful doll play projective technique (8) . 
The paper also describes research re- 
sults suggesting the concurrent and 
construct validity of this instrument. 

The SDP was developed because 
clinical experience with children sug- 
gested the need for a projective test 
for use with children from three 
through 11 more suitable than those 
available. The first approach in de- 
veloping the SDP was to use dolls in 
the presentation of the Duess incom- 
plete fables (3, 9) rather than simply 
administering the fables orally. Duess 
orally presented incomplete fables as 
a method of studying resistance in the 
psychoanalysis of children between 
three and 15, The Duess fables sug- 
gested some of the features that are 
still retained in the test, although it 
has been modified so often that the 
similarity between the doll play situ- 
ations and the fables is now remote. 


The SDP consists of presenting the 





* This article is based on a paper of the same 
title read at the May, 1958 Rocky Mountain 
Psychological Association meeting in Santa 
Fe, New Mexico. The research findings were 
from Rosalie Lynn’s master’s thesis in the 
Department of Psychology, Purdue Univer- 
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to Dr. Janet Lyon for direction of the inves- 
tigation, and to Dr. William L. Sawrey, Uni- 
versity of Colorado School of Medicine, and 
Mr. James A. Norton, Purdue University, 
for statistical consultation. 


* Published by Test Developments, Box 8306, 
Denver, Colorado. 


S with doll-house dolls in a series of 
typical family and age-mate situations. 
Each situation is structured by telling 
the S what is going on, and he is al- 
lowed to resolve the situation through 
doll play. Thirty to forty-five minutes 
are usually required in administering 
the test. Children characteristically 
enter wholeheartedly into the play 
and give clinically rich responses. 


Following is a list of the situations 
included in the SDP with brief state- 
ments of their content: 
1A Crib or Bed Choice. S is asked 
to decide whether the ego-doll 
wishes ‘to sleep in the bed or 
crib, both of which are pre- 
sented in the arrangement. 

1B Bottle or Cup Choice. S decides 
whether the ego-doll will drink 
milk from a cup or bottle. 

2A Peer Choice. § has the ego-doll 
choose whether he (or she) 
wishes to be with a boy doll or 
girl doll. 

2B Visitor. The unchosen doll in 

Situation 2A is presented as vis- 
iting the ego-doll, and S elabo- 
rates what happens. 
8A Fighting. Two same-sex dolls 
are said to be fighting. S resolves 
the situation and is asked to tell 
what they are fighting about. 

3B Misbehavior. Me orga chil- 
dren are said to have done some- 
thing wrong. S is asked to ex- 
plain. 

4A Feeding. S chooses and elabo- 

rates on which parent feeds the 
ego-doll. 

4B Injury. The ego-doll is hurt. 

The kind of injury and its seri- 
ousness are elicited. 

5A Pursuit. § chooses which parent 

is running after the ego-doll and 
explains the circumstances. 
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5B Reprimand. S$ chooses the par- 
ent who is to be the one scold- 
ing the ego-doll. 

6. Parental Relationship. S$ tells 
what happens in a situation in- 
volving the parent dolls alone 
together. 

7A Separation. The parent dolls 
leave the ego-doll alone for a 
while. S tells what happens. 

7B Hospital. The ego-doll is said 
to be hospitalized. Reasons for 
this and consequent events are 
elicited. 

8A Toilet, S chooses which parent 
will take the ego-doll to the 
toilet. The ego-doll may use 
either toilet or potty chair. 

8B Bathing. Opposite-sex child 
dolls are bathing together, and 
S relates what happens. 

9. Sleeping Arrangement. A double 
and a single bed are presented, 
and § puts the parent dolls and 
the ego-doll to bed. 

10A Nightmare. The ego-doll is said 
to have a bad dream. S elabo- 
rates on this and is asked for 
content and resolution. 

10B Night Fears. The ego-doll is 
said to be afraid of something 
he sees or hears during the 
night. S elaborates on this. 

Most of the SDP situations also in- 
volve one or both parents, often call- 
ing for a choice on the part of S. 

Situations 1A, Crib or Bed Choice, 
and 8A, Toilet, are presented below 
to illustrate more fully the nature of 
the SDP situations. In Situation 1A, 
S is told, “The little boy (or girl, the 
ego-doll is determined by the sex of 
S) has two places he (she) can sleep. 
He (she) can sleep in either the crib 
or the bed. You put him (her) to 
sleep.” S is then asked, “Who comes 
in, Mommy or Daddy?” and, “Then 
what happens?” 

In 8A, S is asked, “Which one 
takes him (her) to the toilet, Mommy 
or Daddy?” S is then asked, “Which 
one does Mommy (Daddy, depending 
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on the choice) take him (her) to, the 
toilet or potty chair?” 

A ae, conducted by Lillie and 
Lillie (5), suggests that a structured 
doll play approach may have the ad- 
vantage of eliciting more clinically 
useful material than free doll play. 
Using eight- through 12-year-old male 
subjects, they —— the effects of 
a structured doll play approach 
(though not the SDP) with free doll 
play using comparable materials. 
They found that the structured ap- 
proach elicited significantly more in- 
cident agreements than free play. By 
incident agreement is meant any obvi- 
ous parallel between responses in the 
doll play and a parent interview. 
Structured doll play also elicited sig- 
nificantly more identification of the 
Ss with the dolls, more hostility, and 
more parental discipline responses. In 
addition, the Ss tended to accept the 
structured play better than the free 
play. The material, which was better 
elicited by the structured procedure, 
would seem to be clinically useful. 

The SDP uses the standard doll fam- 
ily (mother, father, child) rather than 
a duplicate of the child’s own family 
constellation. This is done since Rob- 
inson (11) found that variation in 
the doll family constellation had little 
effect on the scored variables with the 
exception that identification with the 
dolls is greater when the child’s own 
family is duplicated by the dolls, The 
kind and amount of aggression did not 
differ significantly. In a clinical (rath- 
er than research) setting, it is not 
always possible to obtain valid infor- 
mation concerning the family constel- 
lation necessary to duplicate the 
child’s family with dolls. It is felt that 
the gain in standardized procedure 
and in simplification by using the 
standard family outweighs the loss in 
the S’s identification with the dolls. 

The 10 SDP situations consist of 
eight which are divided into two com- 
parable sub-situations, thus making 18 
situations in all. In each of these eight 
situations, so divided, the S is pre- 
sented with a choice of dolls, seven 
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of mother or father, one of the boy or 
girl. Ammons and Ammons (1) con- 
ducted a study which suggests that 
structuring parent-choice situations 
might be a useful projective device. 
They found adequate reliability in 
consistency of choice, and they discov- 
ered significant age and sex differ- 
ences in parent preference. The na- 
ture of the choices conformed so well 
to cultural patterns that some evi- 
dence for the validity of the doll play 
parent-choice technique can be in- 
ferred. 

In the SDP, after the S has made a 
choice in a given situation, he is ad- 
ministered a comparable situation 
using the parent doll which was not 
chosen in the original one. In this 
way the S has equal opportunity to 
fantasy about both parents. For ex- 
ample, if the mother doll is chosen to 
take the ego-doll to the toilet in Situ- 
ation 8A, Toilet, the father doll is 
introduced in Situation 8B, Bathing. 
There is an advantage over free doll 
play in this regard since, in free play, 
a child sometimes concentrates all his 
fantasy on one parent doll, ignoring 
the other completely. 

The SDP has eliminated the elab- 
orate setting, the doll-house rooms, 
the suite of furniture, etc., typical of 
traditional doll play procedures. All 
of the SDP materials can be contained 
in a small carrying case. The test can 
be administered on a small space such 
as the typewriter leaf of a desk. It is 
felt that having so few materials re- 
duces the exploratory and tangential 
behavior which plagues traditional 
doll play procedures using elaborate 
materials. To reduce distraction, only 
the materials that are used in a given 
situation are placed before the S, the 
others being kept in the closed carry- 
ing case. 

The use of doll furniture that is 
crudely representative rather than 
realistically detailed, is based on find- 
ings in a study by Phillips (10). Phil- 
lips found that the use of ambiguous 
doll figures and block-like toy furni- 
ture, constructed crudely to represent 
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items rather than in realistic detail, 
reduces exploratory behavior. How- 
ever, extremely ambiguous materials 
are not used in the SDP because the 
nature of the test requires the S to 
recognize the furniture (e.g. toilet or 
potty) and to recognize the doll figures 
(e.g. mother or father doll) . 

It is felt that a structured doll play 
procedure is less conducive to tangen- 
tial and exploratory behavior than 
free doll play in that it focuses the 
child more closely on the clinically 
rewarding fantasy. 

The SDP differs in many ways from 
the Driscoll (2) and Whitmore (4) 
doll play kits. The Driscoll kit claims 
only to furnish standard materials, a 
23 x 20 inch, five-room apartment, 27 
pieces of furniture, etc., but it does 
not furnish structured situations. The 
Whitmore kit offers standard mate- 
rials and also suggests a number of 
incomplete stories which might be 
used with the kit. It does not include 
choice situations, nor is it designed as 
a standardized projective test as is the 
SDP. 


SDP RESEARCH 


Fortunately, the period in Clinical 
Psychology is virtually ended when a 
psychologist can originate a test and 
place it on the market without sup- 
pans research. Some research has 

en completed with the SDP and 
more is currently under way. The fol- 
lowing report of two studies offers re- 
sults that suggest areas of validity. 

In the first study (12), there were 
80 mother-child pairs from a typical 
sailor district in Norway.? There were 
40 mother-child pairs in a father-ab- 
sent group (21 boys and their moth- 
ers, 19 girls and their mothers). In 
this group the fathers were sailors and 
were away from home at least nine 
months out of the year and some- 
times as long as two years or more, 
with only infrequent and irregular pe- 





* The senior author collaborated with Mr. 
Tiller in conducting this research at the In- 
stitute for Social Research in Oslo, Norway. 
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riods at home. There were also 40 
mother-child pairs in the control 
group (20 boys and their mothers, 20 
girls and their mothers). The chil- 
dren’s ages ranged from eight through 
nine years six months. The two groups 
were comparable in the teacher’s rat- 
ing of the child’s intelligence and in 
the father’s occupational status. Both 
groups were essentially middle-class, 
the sailors all being officers. The chil- 
dren were administered the SDP and 
their mothers a semi-structured inter- 
view. 


In this study of father-absence in 
Norwegian sailor families, construct 
validity in the SDP is suggested by the 
fact that the findings support a theo- 
retical formulation of the effects of 
father-absence. A number of predic- 
tions were explicitly stated prior to 
the data collection, most of which 
were supported by the SDP results. 
For example, it was predicted, and 
supported by SDP findings, that moth- 
ers of father-absent children would be 
more over-protective than mothers of 
father-present children, that father-ab- 
sent children would respond with de- 
pendence, and that boys would be 
more strongly affected by father-ab- 
sence than girls. 


Concurrent validity is suggested by 
the fact that the SDP discriminated 
between groups, i.e. the results differ- 
entiated between the sexes, and be- 
tween father-absent and father-present 
groups. For example, a significantly 
higher proportion of girls than boys 
chose the mother doll more often than 
the father doll. Interestingly, a signifi- 
cantly higher proportion of father-ab- 
sent than father-present boys chose the 
father doll more often than the moth- 
er doll, and the crib rather than the 
bed. A significantly higher proportion 
of father-present boys gave aggressive 
doll play responses between age-mate 
dolls than did father-present girls. A 
significantly larger proportion of fath- 
er-absent than father-present girls gave 
high dependency responses in the 
SDP. 
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The following is a report of find- 
ings concerning the SDP responses of 
four- and six-year-old children. 


Subjects 


The subjects were 49 middle-class 
children from a private school in In- 
dianapolis. There were 26 four-year- 
olds (12 boys, 14 girls) and 23 six- 
year-olds (11 boys, 12 girls). 


Scoring Procedures 


The SDP was scored for dependency, 
mature-immature choices, and parent 
doll choice. The child’s dependency 
score was determined by the propor- 
tion of dependent to independent re- 
sponses in the SDP. A response was 
scored dependent if the child doll was 
shown to seek help, sympathy, conso- 
lation, support, attention, affection, 
protection, or ate closeness. It 
was also scored dependent if the child 
doll was shown as immature, e.g. by 
crying. Independence, on the other 
hand, was scored if the child doll 
showed self-sufficiency or self-assertion, 
e.g. by carrying out the major part of 
an action by himself. The agreement 
between independent ratings by two 
judges on this variable was 92%. 

In the mature-immature coding, a 
mature choice was scored for bed 
rather than crib and also for cup 
rather than bottle. Conversely, crib or 
bottle were the two immature choices. 

In the early modification of the 
SDP used in the present study, there 
were only two possible parent choices 
rather than the eight parent choices in 
the current revision.* In the Play Sit- 
uation the S chose the parent doll with 
which the ego-doll wished to play a 
game. In the Construction Situation 
the S chose the parent doll from 
which the ego-doll wished help in 
building something with blocks. 


Results 
In the statistical treatment, analysis 
‘Dr. Bertram D. Cohen, Lafayette Clinic, De- 


troit, Michigan, collaborated in the ——. 
ment of this early modification of the SDP. 
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of variance tested results for depend- 
ency. Chi-square was used in all other 
tests of significance with the exception 
that Fisher’s exact test was used in 
one analysis of the data concerning 
bottle-cup choice where the frequen- 
cies were especially low. 

Concurrent validity in the SDP is 
suggested in this study by the differ- 
entiation between groups (age and sex 
comparisons). Construct validity is 
suggested by the logical relationship 
between these findings and develop- 
mental patterns, 

Table I indicates a significant age 
difference for dependency scores. 
Table II indicates that, as expected, 
four-year-olds received higher depend- 


TABLE I—Age and Sex Differences 
for Dependency Scores 








Source df MS F 
Sex 1 047 <i 
Age 1 426 6.45* 
Sex x Age Interaction....... 1 518 7.84* 
aE ae 42 .066 


* Significant at 5% level of confidence. 


TABLE IJ—Sum of Dependency 


Scores 
Age 4 Age6 Total 
on eI ROOD 10.15 3.84 13.99 
EET ENS 9.62 7.77 17.39 
, eee 19.77 11.61 31.38 


TABLE III—Percentage of Immature 
Choices (Crib over Bed, 
Bottle over Cup) 


No One Two 
Imma- Imma- Imma- 
ture ture ture 
Choice Choice Choices N 
Boys—Age 4 ........ 8 33 58 12 
Boys—Age 6 ........ 55 27 18 11 
Girls—Age 4 ........ 36 29 36 14 
Girls—Age 6 ........ 33 50 17 12 


Note: Rounding errors prevented the sum of 
percentages from totaling 100 in all rows in 
Tables III and V. 
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ency scores than six-year-olds. Table I 
also indicates a significant age-sex in- 
teraction. Table II shows no appreci- 
able difference for dependency scores 
between four-year-old boys and girls, 
but six-year-old boys received lower 
dependency scores than the girls and 
than the four-year-old boys. 

In Table III, there was a trend, al- 
though not statistically significant, for 
a higher proportion of four-year-olds 
to make immature doll play choices 
than six-year-olds (p.—.08) . 

Table IV shows that the greater 

roportion of four-year-olds making 
immature choices was primarily ac- 
counted for by the bottle choice 
rather than the crib choice. A signifi- 
cantly higher proportion of four-year- 
olds than six-year-olds chose the bot- 
tle (p=.01). In this table a signifi- 
cantly higher proportion of four-year- 
old boys chose the bottle than four- 

year-old girls (p=—.05) . 

In Table V, there are significant 
age-sex differences in parent doll 
choice (p=.02). A higher proportion 
of six-year-olds chose the same-sex par- 
ent doll than four-year-olds, i.e. a high- 
er proportion of six-year-old boys than 
four-year-old boys chose the father 
doll, and a higher proportion of six- 
year-old girls than four-year-old girls 
chose the mother doll. 

An inspection of Table V indicates 
that the six-year-olds chose the parent 
doll more discriminately than the 
four-year-olds. A smaller proportion of 
six-year-olds than four-year-olds chose 
the mother for Situation 10, Construc- 
tion, choosing the father instead, and 
a larger proportion chose the mother 
for Situation 15, Play. The six-year- 
olds seem to have decided that it is 
the function of fathers not mothers to 
help one build things. 

Thus, as would be predicted on the 
basis of the usual pattern of child de- 


TABLE IV—Frequency of Mature-Immature Choices 





Boys — Ages 4 
Boys — Age 6 





Girls — Age 4 





Girls — Age 6 


Crib Bed Bottle Cup N 
8 4 10 . 2 
5 6 9 11 
8 6 6 8 14 
8 4 














340 


The Structured Doll Play Test 


TasLE V—Percentage of Mother Doll Choices in Situation 10 (Construction) 
and Situation 15 (Play) 


Neither 


Boys — Age 4 42 
Boys — Age 6 64 
Girls — Age 4 50 
Girls — Age 6 8 














velopment, the younger children gave 
more immature SDP responses than 
the older children. Perhaps it would 
not be so readily predicted that a 
higher proportion of the older than 
the younger children would choose 
the same-sex parent doll. However, if 
one conceives of parent doll choice as 
representing the parent toward which 
the child strives to identify, then 
these results seem reasonable. That is, 
as the child grows older he strives 
more strongly to identify with the 
same-sex parent. This interpretation 
of parent doll choice would help ac- 
count for the fact that, in the father- 
absent study reviewed above, a higher 
proportion of father-absent than fath- 
er-present boys chose the father doll 
more often than the mother doll (12) . 
Perhaps the father-absent boy, virtu- 
ally lacking a father, strives all the 
more to identify with the absent 
father, and consequently tends to 
choose the father rather than the 
mother doll. 


CaAsE ILLUSTRATIONS 


The following case illustrations are 
presented to give the reader some fa- 
miliarity with the quality of SDP re- 
sponses, some notion of the kinds of 
material the clinician using the SDP 
has to work with, and an impression 
of the nature of the relationship some- 
times found between SDP responses 
and data obtained from other sources. 
In order to cover a wide variety of il- 
lustrative responses, brief excerpts are 
taken from a number of SDP proto- 
cols rather than treating one record in 
depth. The resulting over-simplifica- 
tion is not intended to imply that this 
instrument should be used in a “cook 
book” fashion. Neither are these il- 
lustrations presented as interpretive 


Situation 10 Situation 15 Both 
33 17 
0 36 
29 21 
17 58 1 


guides, for that is beyond the scope of 
this particular paper. 

The Ss used in these case illustra- 
tions are drawn from two sources. One 
group of Ss were administered the 
SDP as part of an evaluation at a uni- 
versity medical center child guidance 
clinic (henceforth referred to as Child 
G Ss). They were referred because of 
psychological disturbances. The sec- 
ond group were administered the SDP 
as part of a research project at a uni- 
versity medical center pediatrics out- 
patient clinic (henceforth referred to 
as Ped OP Ss). No S was included in 
the Ped OP group who came to this 
university medical center pediatrics 
out-patient clinic because of a psycho- 
logical disturbance, but only those 
who came because of physical illness. 
Most of the illustrative material com- 
ing from sources other than the SDP, 
used for comparison with the SDP re- 
sponses, was obtained from a semi- 
structured interview conducted with 
the mothers, In some of the Child G 
cases, material was also obtained from 
the social worker’s interview with the 
parents and from the psychiatric in- 
terview with the child. 

The first three examples concern 
various mature-immature choice pat- 
terns (cup- bottle, bed -crib, toilet - 
potty). In actual clinical use one 
would never consider the choices in 
isolation as is done here for simplic- 
ity’s sake. 

The first S for consideration is 
Child G subject A. Subject A is an 
eight-year-11-months old boy display- 
ing infantile speech and immature be- 
havior, such as crying easily. A con- 
siders boys too rough and plays main- 
ly with girls. He is quite close to the 
mother and relies heavily on her for 
support, guidance, and protection. 


PC a eT 





Davin B. LYNN AND ROSALIE LYNN 


A, despite his age, made all imma- 
ture choices on the SDP, viz. crib, bot- 
tle, and potty. 

Ped OP subject B is a four-year- 
eight-months-old girl referred because 
of recurring upper-respiratory infec- 
tions. The mother did not feel that B 
feared the doctor, and reported that 
B, on her own, made an effort to be 
especially clean in preparation for the 
physical examination, whenever she 
was to see the doctor. The mother re- 
ported that B is content to play alone, 
but, on the other hand, adjusts well 
to groups also. She referred to B as an 
“easy going child.” 

This seemingly mature girl, al- 
though much younger than A, made 
all mature choices, viz, bed, cup, and 
toilet. 

Child G subject C is an encopretic 
seven-year-11-months-old boy who, ac- 
cording to the mother, “started hav- 
ing trouble with his BMs just before 
he started going to school.” 

C made two mature choices (bed 
and cup). However, the one imma- 
ture choice that this encopretic boy 
made was the choice of potty instead 
of toilet. 

The next three examples illustrate 
patterns of parent doll choices. In the 
study with the SDP, which was treat- 
ed in greater detail above, it was 
found that a significantly higher pro- 
portion of father-absent than father- 
present Norwegian children chose the 
father doll more often than the moth- 
er doll (12). In this study the absent 
fathers were Norwegian sailors whose 
occupation kept them away from 
home for long periods of time. Clini- 
cal experience suggests the hypothesis 
that high father doll choice accom- 
panies father-absence in other than 
the specific conditions surrounding 
Norwegian sailor families. High father 
doll choice seems particularly likely 
where father-absence is coupled with 
maternal rejection, 

Ped OP subject D is a five-year-11- 
months-old girl whose epileptic sei- 
zures are well controlled. She is an 
illegitimate child who has never lived 
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with her father. The mother admits 
hostile feelings toward this unwanted 
child. When D was asked, “What do 
you think makes children sick?” she 
replied, “Because they don’t like their 
mothers.” 

D chose the father doll in seven out 
of eight possible choice situations.® 
This proportion of father-doll choices 
is very rare, particularly with girls. 

Ped OP subject E is a 10-year-three- 
months-old hemophiliac boy who has 
been separated from his real father 
since age two due to divorce. A step- 
father has been in the home since the 
boy was six years old. The mother’s 
strong feelings about E in connection 
with his illness are characterized by 
her comments, “It’s all my fault. I’m 
the carrier. I shouldn’t have had him 
in the first place. I’ve often had the 
idea, well, maybe he would be better 
off dead.” 

E chose the father doll seven out of 
eight possible times. In Situation 9, 
Sleeping Arrangement, E placed the 
boy and the father in the double bed, 
and the mother doll alone in the 
single bed. 

Child G subject F is a nine-year-10- 
months-old boy who was referred for 
infantile speech. The father, in this 
case, is in the home. The mother re- 
ported that the child demands a great 
deal of attention from her. He, in 
turn, brings the mother candy. When- 
ever the mother kisses his 15-year-old 
brother, F asks her whether she loves 
him as much as she does his brother. 


F chose the mother doll in all eight 
choice situations. 

Ped OP subject G is used to illus- 
trate peer choice in Situation 2A, 
Peer Choice. G is a six-year-11-months 


® There are seven SDP situations in which the 
S is presented with the choice of mother or 
father doll. Parent choice is also scored in 
Situation 9, Sleeping Arrangement, making 
8 possible choices in all. In this situation the 
parent doll placed in the same bed with the 
ego-doll, is considered the chosen parent. In 
addition, when the ego-doll is placed alone 
in the single bed, the parent doll placed on 
the side of the double bed next to the ego- 
doll is considered the chosen parent. 
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old boy referred because of an upper- 
eo infection, 

’s mother reported that he likes to 
dress up in dresses and go to his girl 
friend’s house. “He could play by the 
hour with her.” 

G chose the girl doll in the Peer 
Choice situation. 

Sometimes a specific situation will 
illuminate a specific problem. Child 
G H is an eight-year-nine-months-old 
girl who was referred because of en- 
copresis. The mother reports that “she 
don’t seem to know even after her 
bowels have moved.” 

H gave the following response to 
Situation 8A, Toilet: 

“Mommy takes the little girl to the 
potty and she can’t go. She doesn’t 
need to, but Mommy makes her go 
anyhow.” 

Typically, as the next case illus- 
trates, a particular dynamic pattern 
may be reflected repeatedly through- 
out the test protocol, and in a variety 
of forms. 

Ped OP J is an 11-year-old epileptic 
boy whose mother is blatantly over- 
protective. She so infantilizes J that 
the father complains that she “packs 
I around on a silver platter.” Her pro- 
tective feelings are reflected in her re- 
lating an incident in which J feared 
his hair would be cut in preparation 
for an EEG examination. The mother 
related that she reassured J that they 
would not cut his hair. The mother 
exclaimed to the interviewer, ‘No, 
my gracious, they couldn’t mess that 
curl up. That would be tragic.” 

This 11-year-old boy reflected over- 
protection in Situation 1A, Crib or 
Bed Choice, by responding as follows: 

I chose the crib and said, “He 
looks like a very young boy.” He 
then made the mother choice and 
said, “Mother usually comes in first 
and checks the child. She has it say a 
prayer and kisses it goodnight.” 

In 8A, the Toilet Situation, J re- 
sponded as follows: 

I chose the potty. He said, “Mommy 
would unstrap his (the boy doll’s) 
pants and he’d have to go. She would 
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strap his pants again and wash his 
hands for him.” 

Situation 9, Sleeping Arrangement, 
was as follows: 

The mother doll, rather than the 
father doll, was placed in the double 
bed on the side closest to the boy doll 
in the single bed. Then 7 said, “He 
(the boy doll) wants to go to the rest 
room or wants water. ‘Mother, moth- 
er, I want water.’ She goes and gets it 
for him.” 

In another situation the boy doll 
was said to cry, and in another to ask 
the mother doll whether she brought 
him a toy from town. 

J is a four - year - five - months - old 
Child G boy brought to the clinic by 
the paternal grandmother because of 
infantile speech. The grandmother 
said that she has taken care of J since 
his birth because his mother (her 
daughter-in-law) neglected him, e.g., 
“She wouldn’t even give him his bot- 
tle.” The grandmother describes the 
mother’s “temper tantrums” during 
which she has become so angry that 
she tears buttons off clothes. J was de- 
scribed as being so afraid of seeing his 
mother during her infrequent visits 
that he would bite his fingernails 
when he saw her approaching. 

J’s SDP responses show both oral 
preoccupation and fear of the mater- 
nal figure. In Situation 7B, Separa- 
tion, J gave the following response: 

“Boy hide. Mommy come back. She 
eat that boy. He all gone. She eat 
daddy. He all gone.” 

Fearful oral preoccupation was re- 
flected in two additional situations, In 
Situation 9, Sleeping Arrangement, he 
said, “Bogy man eats them up,” and 
in Situation 10B, Night Fears, J re- 
sponded, ““Bogy man ate the little boy 
up.” 
Subject K is a nine-year-six-months- 
old Ped OP girl who was referred be- 
cause of vulval vaginitis. Four years 
previously, when she was five, a baby 
brother was born to whom she showed 
extreme hostile rivalry, The mother 
reported, “She (subject K) said I 
couldn’t bring him (the baby) home 
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from the hospital. When we got home 
we couldn’t keep her in the same 
room with him. She would bite him 
or pinch or slap him all the time if 
she got the chance. She’d try to push 
him off my lap. If she got scissors she 
would cut the nipple off his bottle.” 


The mother reported that K and 
her brother now get along “wonder- 
ful.” 


However, in SDP Situation 5B, Rep- 
rimand, K said: 

“Well, she comes home and fights 
with her baby brother. Daddy says, 
‘Don’t fight with him.’ He makes her 
go to bed. He (brother) is only one- 
year-old. She lays there and cries, and 
father tells her not to cry, and they all 
go to bed. She started crying for her 
mother, and mother says, ‘Be quiet,’ 
and she says, ‘No.’ The girl hit her 
father and mother. Mother spanked 
her, and she shoved her brother out, 
and her father and mother spanked 
her again.” 


Situation 6, Parental Relationship, 
is structured so that the parent doll 
are alone. After S relates what tran- 
spires while the parents are alone, the 
child doll is introduced. At this point 
K said: 

“The girl tells her father she got all 
A’s today. Then she tells mother, ‘I 
got all A’s today.’ Mother and father 
say, ‘O.K.’ They ask the girl where her 
brother is. The brother tells mother 
and father he got all A’s today. The 
girl asks her mother if she likes her, 
and mother says ‘no’ so she’ll cry.” 


In Situation 7A, Separation, K said, 
“She goes to the neighbors and the 
little boy asks, ‘Where’s your brother?’ 
She starts fighting.” 

K, in this situation, chose the moth- 
er as the parent doll she wished to re- 
turn to the ego-doll first. K said: 

“The mother wonders where the 
little brother is, and the girl says, ‘I 
don’t know.’ The girl says, ‘Give me a 
drink.’ Mother gives her a drink and 
she throws the glass on the floor and 
goes to bed.” 
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SUMMARY 


This paper described the Structured 
Doll Play test (SDP), which is de- 
signed specifically as a projective doll 
play technique for use with children. 
In this paper, literature was reviewed 
which suggests that a structured ap- 

roach may elicit more clinically use- 
ful material than free doll play, and 
that the SDP has been developed in 
keeping with doll play research find- 
ings. 

Research findings from a study of 
father-absence with the SDP which 
suggest concurrent and construct va- 
lidity were reviewed. An investigation 
of the SDP responses of 49 four- and 
six-year-old middle-class children was 
presented. 

Concurrent validity in the SDP is 
suggested in this study by the dis- 
crimination between groups (age and 
sex comparisons) . 

A significantly higher proportion of 
four-year-olds than six-year-olds_re- 
ceived high dependency scores. A sig- 
nificant age-sex interaction showed six- 
year-old boys particularly low on de- 
pendency. Four-year-olds tended to 
make more immature doll play choices 
than six-year-olds. A significantly high- 
er proportion of four-year-old boys 
than four-year-old girls chose the bot- 
tle. There were significent age-sex dif- 
ferences in parent doll choice, with 
a higher proportion of six-year-olds 
than four-year-olds choosing the same- 
sex parent doll, and choosing the par- 
ent doll more discriminately. 

Construct validity is suggested by 
the logical relationship between these 
findings and developmental patterns. 

Research and clinical results suggest 
that the SDP may well meet the re- 
quirements of a useful projective test 
with children. 
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Relationship between Story Length and Situational 
Validity of the TAT’ 


CAROLINE TAYLOR MACBRAYER 
Mooresville, N. C. 


Significant changes in Thematic Ap- 
ene me Test (TAT) phantasy 
ave been reported following experi- 
mentally produced increases in aggres- 
sion (3), depression (5, pp. 454-455), 
sexual arousal (4), hunger (2), 
achievement motivation (15, 16), and 
affiliation motivation (1, 19). In scor- 
ing TAT stories to reveal these 
changes, which theoretically result 
from Dp of experimentally 
aroused motivation into phantasy, va- 
rious scoring methods have been used. 
Bellak (3, 5, pp. 445457) simply 
counted the number of aggressive 
words in TAT stories under condi- 
tions of no criticism and under condi- 
tions of criticism of Ss’ stories by E, 
and also counted the number of de- 
ressed words and phrases under post- 
ypnotically induced feelings of de- 
pression, as compared to normal. No 
account was taken of length of stories, 
data being in terms of absolute num- 
bers of aggressive words or depressed 
words and phrases, rather than in 
terms of proportions of total words. 


Milam (17) has demonstrated that 
male college students tell significantly 
longer TAT stories when the examiner 
is “as demanding, rejecting, and 
threatening to the subject as possible” 
(17, p. 222), or “as accepting, courte- 
ous, and friendly as possible” (17, p. 
222), as compared to neutral and - 
tions in which the examiner is efficient, 
routine, and noncommital, mean num- 
bers of words per story being 162.80, 
159.53, and 112.48, respectively (17, 
p. 224). Milam’s results strongly sug- 
gest that increased motivation results 
in increased length of TAT stories, 
and with increased story length, the 
frequency of almost any type of word 





*This research was conducted at Davidson 
College. 


or thema might well be expected to 
increase. In order to demonstrate the 
situational validity of the TAT, there- 
fore, it is necessary to show that an in- 
crease in the specific type of phantasy 
experimentally induced occurs inde- 
pendently of increased story length. 


McClelland et al. (15, 16) have at- 
tempted to control the factor of story 
length in their work on arousal of the 
achievement motive by limiting TAT 
stories (written) to five minutes each. 
Average story length under these con- 
ditions was found to be about 90 words 
(15, p. 404), and these authors feel 
that “.. . coritent of the stories is not 
systematically related to their length” 
(16, p. 212). Relevant data presented 
are mean words per story for 28 
achievement-oriented high school sub- 
jects (M = 77.2) as compared to 18 
neutral similar high school subjects 
(M = 68.3), the difference between 
the means not reaching statistical sig- 
nificance at the 5% level (16, p. 212). 
Similar time limits were placed on 
TAT stories in experimental arousal 
of hunger (2), the affiliation motive 
(1, 19), and sexual motivation (4). 
Variation in story length is certainly 
reduced by a five-minute time limit, 
but whether this is an adequate con- 
trol of story length, or even a desirable 
one if longer stories are actually more 
revealing of needs and desires?, is ques- 
tionable. Projection of experimentally 
aroused achievement (15) and affilia- 
tion motives (1, 19) was measured by 
frequency counts of items which in- 
creased significantly under arousal 
conditions. In Clark’s (4) scoring of 
TAT stories for manifest sex imagery 





* Length of TAT stories which are unrestric- 
ted by a time limit has been used in some 
studies (10, 14) as a criterion of excellence 
of performance. 
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after experimentally induced sexual 
arousal, sexual activity was scored just 
once for each story. In scoring, priority 
was given to that type of imagery in 
each story which was biologically most 
sexual, the three types being: “ (a) 
primary—explicit or implicit evidence 
for sexual intercourse, (b) secondary 
—evidence for the occurrence of such 
secondary sex activity as kissing, danc- 
ing, or fondling, (c) tertiary—charac- 
ters in the stories perceived as sweet- 
hearts, on a date, courting, in love, and 
so on, but not engaged in either pri- 
mary or secondary sexual activity” (4, 
p- 47). Symbolic sexual imagery, on 
the other hand, was scored by Clark 
(4) for each story as “strongly” pres- 
ent (weight of 2), “some” symbolism 
present (weight of 1) , or none (weight 
of 0), depending on to what extent 
themas which could be interpreted as 
sexually symbolic were present. Clark’s 
scoring thus gives frequency of sexual 
imagery more weight in the symbolic 
than in the manifest sex score, varia- 
tions in story length being thereby rel- 
atively more important in the sym- 
bolic score. 

The present experiment was de- 
signed to determine whether criticism 
results in a significant increase in 
length of TAT stories, and if so, 
whether increase in aggressive phan- 
tasy, as measured by numbers of ag- 
gressive words in the stories, is chiefly 
a function of increased story length, 
rather than being due to projection of 
increased aggression which has been 
experimentally induced by criticism. 
For purposes of this experiment the 
assumption was made that criticism 
does arouse aggression, but no assump- 
tion was made concerning the nature 
of the motivational variable or vari- 
ables intervening between criticism 
and increase in TAT story length. It 
was assumed that if increase in aggres- 
sive words with criticism is primarily 
a function of increased length of TAT 
stories with criticism, four other types 
of phantasy, presumably unrelated to 
the experimentally induced motiva- 
tion of aggression, would show similar 


Story Length and Situational Validity of the TAT 


increases and that frequency of each 
of the five types of phantasy would cor- 
relate significantly with story length. 
The specific hypotheses to be tested are 
as follows: 

1. Criticism significantly increases 
the length of TAT stories. 

2. While criticism increases the abso- 
lute number of aggressive words in 
TAT stories, it does not increase the 
proportion of aggressive words rela- 
tive to the total number of words in 
TAT stories. 

3. Increase in number of aggressive 
words in TAT stories with criticism is 
due to increase in length of TAT 
stories with criticism, rather than to 
the projection of experimentally 
aroused aggression into phantasy. 


METHOD 


Subjects were 20 psychologically 
naive college freshmen, 10 males? and 
10 females, It was assumed that Ss 
would more freely project socially dis- 
approved feelings of hostility and ag- 
gression into TAT phantasy in the 
presence of student Es than a faculty 
E (18), so two male upperclassmen 
served as Es* throughout the experi- 
ment. Only the two Es and the particu- 
lar S being tested were in the room at 
any time during the experiment, The 
procedure was similar to Bellak’s (3), 
each § telling six TAT stories, the first 
three under normal or no-criticism 
conditions, and the last three under 
conditions of criticism by the Es. The 
TAT pictures used were: 18GF, 
18BM, 13MF, 3GF, 10, and 3BM, this 
order being used with one-half of the 
Ss of each sex, and the reverse order 
with the other half. Instructions were 
as follows: 

This is a test of imagination, one form of 


* Data for one male S are not included under 
Results (N=19 instead of 20), because 
length of his stories under the first condi- 
tion, without criticism, fell above the upper 
limit of the range for numbers of words 
per story reported by Dana (6) for psy- 
chotics. 

“The author wishes to express appreciation 
to L. M. Mayfield and E. I. Stevens, who 
served as Es. 
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intelligence. I am going to show you some 
pictures, one at a time, and your task will 
be to make up as dramatic a story as you 
can for each. Tell what has led up to the 
event shown in the picture, describe what is 
happening at the moment, what the charac- 
ters are feeling and thinking, and then give 
the outcome. Speak your thoughts as they 
come to your mind. No other person will 
know who has told what story, so do not be 
afraid to speak openly and frankly. Do you 
understand? 


The same E made some encouragin; 
remark to each S after the S complet 
one practice story in response to TAT 
picture 7GF before the experiment be- 
gan, and also after S’s first and second 
stories during the experiment. These 
remarks were “good,” “fine,” “O.K!” 
(said approvingly) or “That’s the 
idea!” After the third, fourth and fifth 
stories, criticism patterned after that 
of Bellak (3, p. 361) was made of S’s 
stories as follows: 


[After the third story], “Your stories are 
not as good as they could be. They are illogi- 
cal and unstructured. You lack imagination 
and ideas in your stories. Try and see if 
you can do better on this one.” [After the 
fourth story], “I hope you do not mind if I 
am frank, but your stories still are not very 
good. In fact, I would go so far as to say 
that they are about the worst that we have 
had, so let’s see you improve on this one!” 
[After the fifth story], “We have one more 
picture here for you, though I’m afraid you 
are almost hopeless. Anyway, put everything 
you have into this one!” 


As soon as each § had completed his 
sixth story, he was told that he had 
been fed standard comment, was made 
to feel that he had done well, and 
was thanked for his cooperation. Ss 
spoke into a microphone and were 
aware that their stories were being re- 
corded on tape. No names were used 
and full anonymity of results was as- 
sured each S by the Es. 

Aggressive words and depressed 
words and phrases in all TAT stories 
were listed independently by two 
judges, There was a high percentage 
of agreement between judges, and 
where disagreements occurred, a mu- 
tual decision was reached after dis- 
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cussion. Total number of aggressive 
words and total number of depressed 
words and phrases were thus obtained 
for each S’s three TAT stories under 
criticism and no-criticism conditions, 
as well as total number of words in 
each S’s three stories under each con- 
dition. Two judges also scored all 
stories for number of manifest sex 
words (4), number of symbolic sex 
themas (4), and number of optimistic 
words and phrases, the mean for the 
two judges being used as the score. Co- 
efficients of correlation between scores 
by the two judges ranged from .86 to 
.94. Optimistic words and phrases were 
defined for purposes of this experi- 
ment as those falling within one or 
more of the following categories: 


1. Hope: things will, or probably will 
get better or work out. 

2. Faith: prayer, faithful wife, help 
from above, etc. 


3. Kindness: consideration for others 
or for pets; sympathy, support given 
to others. 

4. Patience: only when stated or im- 
plied that patience is a virtue. 


5. Happiness: includes all kinds of 
happiness except illicit sex if clearly 
stated or implied that the illicit sex 
— makes person feel guilty, 
unhappy, or anxious. 


In addition to numerical scores 
which were obtained for each § under 
the two conditions, as outlined above, 
each numerical score was also ex- 
pressed as a proportion by dividing the 
numerical score by the total number of 
words in S’s three stories from which 
the numerical score was obtained. 


RESULTS AND DISCUSSION 


Results for male and female Ss are 
combined, since the Sign-Rank Test of 
differences showed no significant sex 
differences in scores except a signifi- 
cantly larger number of manifest sex 
words for males. Table I reveals that 
story length increased with criticism, 
the difference between the means of 
total numbers of words with and with- 
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TasLe I—Mean Numerical and Proportional Scores for Three TAT Stories 
with and without Criticism 
(N = 19) 


Type of Numerical score 
Score M without M with 
criticism criticism 
Total words.............. 280 432 
Aggressive words..... a2 15.1 
Depressed words...... 153 23.0 
Manifest sex............. 6.0 10.7 
Symbolic sex............ 9.3 17.2 
Optimistic words.... 8.3 17.6 


* Significant at better than .05 level. 
** Significant at better than .01 level. 


out criticism being significant at bet- 
ter than the 1% level. This confirms 
the first hypothesis, that criticism sig- 
nificantly increases length of TAT 
stories. 

The number of aggressive words 
also increased significantly with criti- 
cism, while the proportion of aggres- 
sive words did not (see Table I), 
which confirms the second hypothesis. 
At the same time, each of the four 
other numerical scores, namely, de- 
pressed words, manifest sex words, 
symbolic sex themas, and optimistic 
words, also increased significantly with 
criticism, while none of the propor- 
tional scores did so. From these results 
one might conclude that increase in 
aggressive words with criticism, as well 
as increase in the four other numerical 
scores, is a function of increased length 
of TAT stories with criticism. Further 
analysis of the data, however, reveals 
that this is not true and that hypothe- 
sis 3 is incorrect. 

Table II shows that the only score 


Tas e I1—Correlations between Total 
Words and Five Numerical Scores 
for Three TAT Stories with 
and without Criticism 


(N = 19) 
Without With 
Type of score criticism criticism 

Aggressive words £6** 38 
Depressed words 68** 84** 
Manifest sex 64** 66** 
Symbolic sex 499° S2°° 
Optimistic words 679° 929° 


*Significant at better than .05 level. 
**Significant at better than .01 level. 


Score as a proportion of 
total number of words 


p without p with 
r t criticism criticism t 

.86** 3.90** 

07 225° .033 035 26 
48** 2.78* 055 053 38 
.50* 2.47* 021 025 89 
.68** 5.61** .033 .040 1.45 
61** 3.78** .030 041 2.00 


which does not correlate significantly 
with story length is number of aggres- 
sive words under conditions of criti- 
cism. Further proof that increase in 
aggressive words with criticism is not 
a function of increased story length, as 
increases in the other four scores ap- 
parently are, is seen in Table I. Here 
the correlation between scores with 
and without criticism is significant in 
every case, including total numbers of 
words, — in the case of aggressive 
words, in which case the correlation is 
.07. Both lines of evidence, therefore, 
reveal the same picture and substanti- 
ate the conclusion that, while increases 
in depressed words, manifest sex 
words, symbolic sex themas, and opti- 
mistic words in the present experiment 
are a function of increased length of 
TAT stories with criticism, increase in 
number of aggressive words is actually 
due to projection of increased aggres- 
sion which was experimentally pro- 
duced by criticism. 

Results of the present experiment, 
then, confirm the situational validity 
of the TAT, despite the fact that the 
aa ageage of aggressive words in re- 
ation to total number of words was 
not significantly increased by experi- 
mentally induced aggression. This does 
not, of course, solve the more pressing 
problem of general TAT validity, 
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5 fluctuations, for this increases the 
plausibility of our expectation that the 
test is sensitive to similar enduring 
of characteristics” (13, p. 39) . 

Another conclusion which seems jus- 
t tified on the basis of present results is 
that length of TAT stories is often an 
26 | important factor in their interpreta- 
38 | tion. This is clearly seen in the results 
of the present experiment, where cri- 
ticism is followed by 2 significant in- 
crease, not only in aggressive phantasy, 
but also in depressed, sexual (both 
manifest and symbolic) , and optimis- 
tic phantasy in TAT stories. Most 
striking is the fact that the increase in 
optimistic words is actually more sig- 
nificant than the increase in aggressive 
words (see Table I) . If the important 
factor of story length were ignored, it 
is easy to see how the results of the 
present experiment might be ludic- 
rously interpreted as evidence that cri- 
ticism induces happiness or optimism 
which is projected into TAT phantasy. 
When any type of words, themas, or 
other units (6,7, 8) is used in scoring, 
length of stories becomes a vital factor 
because the longer the story, the better 
its chances of receiving a high score. 
When rating scales are used (9, 11, 17, 
18), frequency as well as intensity of 
affect (or sometimes frequency as an 
index of intensity of affect) is ordi- 
narily used in assigning ratings to 
TAT stories, With this type of scoring, 
too, the longer the story, the better its 
chances of receiving a high rating. Cau- 
tion is therefore urged in applying any 
method of scoring to unusually short 
or unusually long TAT stories. Par- 
ticular care is needed in working with 
abnormal populations, especially when 
their TAT performances are to be 
xperi- compared with those of normals, since 
is does} Dana (6) found that story length for 
essing] five TAT stories significantly differen- 
lidity} tiated a normal (M = 595.8) from a 
TAT] clinical 2 panes of neurotics (M = 
ristics} 431.6) and psychotics (M = 420.1). 
, how,Range of numbers of words also 
o findjshowed a systematic trend, increasing 
trably from 350-1025 for normals, to 225-1195 
porary {or neurotics and to 165-1445 for psy- 
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chotics (6) . This raises the question of 
whether, in a study such as that of 
Gluck (9), wide variations in length 
of stories by 30 psychiatric Ss could ac- 
count for the lack of relationship 
found between amount of hostility in 
six TAT stories and degree of overt 
hostility or aggression under stress. 

It may be that a relatively short time 
limit, such as that used in measuring n 
Achievement (15, 16) and n Affiliation 
(1, 19) , with TAT stories written, will 
be a sufficient control of length for 
group experimental work with normal 
adult Ss (12). McClelland et al. (16) 
feel that they have succeeded in ade- 
quately dividing their male college 
population only into high and low 
achievement groups, or at most, into 
high, middle, and low achievement 
groups (15, p. 406) . In individual pre- 
dictive work, however, especially diag- 
nostic clinical work, and in work with 
other than normal adult populations 
(children, psychotics, neurotics, men- 
tally deficient, etc.) short time limits 
are obviously inappropriate because 
they offer an additional handicap to 
the already difficult task of securing 
productive phantasy. It may be con- 
cluded that in scoring TAT stories, 
some method of taking into account 
the length of stories is desirable, and 
under certain circumstances, is a neces- 
sity. 

SUMMARY 


An experiment was performed to de- 
termine whether increase in number 
of aggressive words in TAT stories fol- 
lowing experimentally produced in- 
creased aggression through criticism is 
due to increased story length. It was 
found that story length did increase 
significantly with criticism, as did five 
other scores: numbers of aggressive, de- 
pressed, manifest sex, and optimistic 
words and symbolic sex themas. When 
these five scores were expressed as pro- 
portions of total words, however, there 
were no significant increases from no- 
criticism to criticism conditions. All 


scores correlated significantly with 
story length except number of aggres- 
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sive words under conditions of criti- 
cism. Each score under no-criticism 
conditions correlated significantly with 
the same score under criticism condi- 
tions except number of aggressive 
words. It was concluded that, while in- 
creases in the other four scores were a 
function of increased story length with 
criticism, increase in number of aggres- 
sive words was due to projection of ex- 
perimentally induced aggression, thus 
substantiating the situational validity 
of the TAT. 
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The Blacky Pictures as Personality Measures 
for Undergraduate Areas of Specialization 
Max G. MAGNUSSEN 


Veterans Administration Hospital, Cincinnati, Ohio 
and University of Kentucky Northern Center, Covington, Kentucky 


The stimulus for this research was a 
study by Teevan (7) that concluded 
that correlations between personality 
and vocation discovered by Roe (5, 6) 
and Knapp and Goodrich (4) for pro- 
fessional groups can also be demon- 
strated during the period of under- 
graduate field of specialization. Tee- 
van used the Blacky Pictures as his 
diagnostic instrument and adminis- 
tered the test in a group setting. He 
mentioned in the article that there 
were two deviations from the standard 
procedure: “ (a) up to 15 seconds long- 
er than the standard two minutes 
were allowed on the spontaneous 
stories; (b) the inquiry questions, 
rather than being flashed on the 
screen one at a time with a time limit 
for each question, were printed at the 
bottom of each page. Ss were instruct- 
ed not to glance at them until told to. 
An observational check and question- 
ing at the end of the session showed 
good observance of these instructions.” 
He failed to specifically report the de- 
viations in clinical procedure of: (1) 
projecting the test stimuli on a screen, 
and (2) the examination being ad- 
ministered in a group. 

The purpose of Teevan’s experi- 
ment was to determine whether or not 
personality factors are correlated with 
choice of a major field in college. 
Three broad groupings of literature, 
social science, and science majors were 
compared on scores derived from the 
Blacky Pictures and several significant 
differences were found between these 
groups. The results were interpreted 
to demonstrate that correlations be- 
tween personality and _ vocational 
groups can be found during the period 
preceding entrance into a profession. 

This study attempted to utilize the 
Blacky Pictures as the personality 


measure in a more orthodox manner 
of administration and to determine if 
the results perviously reported by Tee- 
van could be reproduced. 


PROCEDURE 


The Blacky Pictures (2) were indi- 
vidually administered to sixty male 
students from the University of Ken- 
tucky (Lexington campus and North- 
ern Kentucky Extension Center) . The 
subjects (Ss) ranged in age from 19 to 
23 years. The major fields were divided 
into the same divisions used by Tee- 
van, with twenty Ss from each of the 
academic areas of specialization. Divi- 
sion 1 (categorized as literature) was 
comprised of students (N=20) from 
the English department. Division 2 
(classified as social sciences) contained 
students (N=20) from majors in gov- 
ernment, economics, and history; and 
Division 3 (labeled science specialty) 
was composed of students (N=20) 
majoring in biology, geology, chemis- 
try, and physics. In an effort to obtain 
Ss that were seriously involved in their 
educational field, only students who 
had achieved a B plus grade average 
in the area of concentration were used. 


RESULTS AND DISCUSSION 


The results of the present investiga- 
tion are summarized in Tables I and 
II. The findings are presented in a 
manner that will permit direct com- 
parison with the reported results of 
Teevan (7). Table I presents the sig- 
nificance of the differences between the 
three divisions when Blacky Picture 
disturbance scores are correlated with 
choice of major undergraduate special- 
ty. 
Table II shows the percentages of 
the students in the three divisions that 
gave disturbed and neutral responses 
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Tas_e I—The Relationship Between Blacky Picture Disturbance 
Scores and Choice of Major Field 
Ranking of Division by degree 
of Blacky Disturbance Scores 
Blacky Picture Category Pp Most Middle Least 
Oral Eroticism 05 1 2 3 
Oral Sadism 05 2 1 3 
Anal Expulsiveness NS 
Anal Retentiveness NS 
Oedipal Intensity NS 
Masturbation Guilt NS 
Castration Anxiety NS 
Identification Process NS 
Sibling Rivalry NS 
Guilt Feelings 06 2 1 3 
Ego Ideal NS 
Narcissistic Love Object NS 
Anaclitic Love Object NS 
Code: Division 1. — English; Division 2. — government, history, economics; 
Division 3. — biology, chemistry, physics, geology 
NS=Not significant at .05 level. 
Taste II—Percentage of Each Division Showing Disturbed and 
Neutral Responses to Blacky Picture Dimensions 
Division 
Blacky Picture Dimensions 1 2 3 
Literature Social Sciences Sciences 
Oral Eroticism 
Disturbed 52 43 15 
Neutral 48 57 85 
Oral Sadism 
Disturbed 47 57 ll 
Neutral 53 43 89 
Guilt Feelings 
Disturbed 48 58 28 
Neutral 52 42 72 


to the Blacky Picture dimensions 
which yielded statistically significant 
differences between the groups. 

Table I reveals that the first division 
(English) was the highest of the three 
groups on Oral Eroticism. These find- 
ings corroborate Teevan’s results, but 
with a lower degree (.05 as compared 
to .01 level) of statistical confidence. 
In addition, the findings are again con- 
sistent with Bergler’s (1) observations 
of 36 people with literary abilities un- 
dergoing psychoanalysis and conclu- 
sions that writers are fixated at the oral 
erotic level. These results are also in 
agreement with Boyd’s (3) findings of 
more disturbance at the oral erotic 
level in children that are good readers 
than in slow or retarded readers. 


Therefore, the current data from the 
Blacky Pictures are in accordance with 


the aforementioned investigator's cate- 
gorization of literary persons as being 
essentially passive and dependent. 
The second or social sciences divi- 
sion (government, history, economics) 
was highest on the other category that 
showed significant differences and the 
one category that closely approximated 
the .05 level of statistical significance. 
Oral sadism is purportedly related to 
aggression toward the mother; Guilt 
Feelings, as another dimension, is con- 
tingent on the introjection of parental 
standards. Teevan’s “neat” constella- 
tion of a common element — emphasis 
on the mother, as reflected through 
Oral Sadism, Oedipal Intensity, and 
Anaclitic Love Objects was not found 
in its entirety. Only the Oral Sadism as 
aggression toward the mother (Blacky 
Picture #2) and the Guilt Feelings (.06 
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level of confidence) which might be 
consistently placed into the pattern are 
found to be significant differences. 


Finally, Division 3, or the sciences 
(biology, chemistry, physics, geology) 
has the lowest disturbance score of the 
three divisions on most of the Blacky 
Picture categories. This finding is 
again consistent with Teevan’s results 
and permits a conclusion that under- 
graduate males that migrate toward 
this division are less disturbed in psy- 
chosexual development than those who 
major in the other fields of specializa- 
tion appraised in this experiment. 


For those so inclined, the data pres- 
ent material for interesting clinical 
speculation and “hypothesizing” con- 
cerning personality development for 
various people in specific undergradu- 
ate areas of specialization. The results 
are fairly clear-cut and the present 
author is content to deal entirely with 
the obvious findings. 


SUMMARY AND CONCLUSIONS 


The present study attempted to de- 
termine whether the Blacky Pictures 
were adequate measures to appraise 
personality factors which had previous- 
ly been found to be correlated with 
choice of undergraduate fields of spe- 
cialization in college. The Blacky Pic- 
tures were individually administered 
to three categories of college majors 
consisting of twenty Ss in each —— 
ing. Teevan’s results (7) were repli- 
cated in respect to: 

1. The literature division showing 
higher Blacky Picture disturbance 
scores on Oral Eroticism than the other 
two divisions. 

2. The Social Science division reveal- 
ing higher disturbance scores on Oral 
Sadism and nearly significant differ- 
ences on Guilt Feelings. Teevan’s find- 
ings regarding Oedipal Intensity and 
Anaclitic Love Object were not found 
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to be significantly different in this par- 
ticular sample of sixty University of 
Kentucky students. Teevan had the ad- 
vantage of a larger N (eighty-five 
males) which might account for these 
differences in results. 

3. The Science division exhibiting 
the lowest disturbance scores on prac- 
tically all categories. 

It is concluded that the Blacky Pic- 
tures are fairly adequate measures for 
appraising personality variables as 
these factors are related to undergradu- 
ate field of specialization. In addition, 
Teevan’s results are generally reprodu- 
cible and the current findings seem to 
lend further credence to the view that 
in early stages of academic specializa- 
tion, particular individuals migrate to 
specific college majors because of per- 
sonality factors within themselves. Tee- 
van’s group administration of the 
Blacky Pictures appears to afford bene- 
fits of a savings in time and loses lit- 
tle, if any, information which might 
be gained through individual admin- 
istration of the test. This seems to be 
especially true if only the objective, 
scorable data are being considered. 
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Sexual Identification in Responses to Rorschach Card Ill 


MaArvEN O. NELSON, WILLIAM WOLFSON AND RALPH LOCAscIo 
Middletown State Hospital, Middletown, N.Y. 


A number of writers in the field of 
Rorschach research have suggested 
that responses to Card III of the Ror- 
schach may be used to gain an under- 
standing of the sex identification of 
the subject taking the test. (1, 2, 3, 4) 
The implication of these suggestions 
seems to be that the blot on Card III 
lends itself equally well to the per- 
ception of human forms of either male 
or female sex. 


_ The present study is an attempt to 

test the hypothesis that Card III can 
be perceived equally well as either 
male or female human figures by sub- 
jects taking the test. 


METHOD 


Subjects: The data used in the study 
were obtained: (1) From the files of 
Rorschach protocols obtained from 
159 males and 110 females registered 
at the Middletown State Homeo- 
pathic Hospital as regular patients or 
as referrals for psychiatric examina- 
tion; and (2) from protocols obtained 
from 32 males and 52 females who 
were not hospital patients. This sec- 
ond group was mainly composed of 
persons who had applied for admission 
to the State Hospital School of Nurs- 
ing, employees of the State Hospital, 
and a limited number of persons from 
nearby colleges and communities. An- 
other group (3) was composed of 21 
hospitalized and non-hospitalized per- 
sons who were known to have engaged 
in overt homosexual behavior. All pro- 
tocols used in this study were selected 
from a larger collection of protocols on 
the basis of clear designation of human 
form as a response to Card III. Only 
those giving the human form response 
during the free association period of 
the test were used. The subjects from 
whom the protocols were obtained 
ranged in age from 10 years to 79 


years, with 92 per cent of them be- 
tween the ages of 18 and 50. 

Procedure: The Rorschach proto- 
cols were obtained from the patient 
group as a part of their psychodiag- 
nostic examination. A number of the 
non-patient group were given Ror- 
schach tests as a part of the examina- 
tion requirements for admission to 
Nursing School, and others were 
volunteers tested as a part of a re- 
search project. 

Treatment of data: The subjects 
were divided into the following 

oups: Male non-patients (N=32), 
emale non-patients (N=52), male 
patients (N=159), female patients 
(N=110), and male homosexuals 
(N=21). The chi square technique 
was used to test the significance of the 
distribution of responses within each 
group and to compare the various 
groups in regard to their differences 
in distribution of responses. 


RESULTS AND DISCUSSION 


The frequency distributions of 
responses are shown in Table I, along 
with the percentages as indicated. It 
should be noted that the percentage of 
those giving a “neutral” response 
(that is, those designating the human 
figure without indicating the sex) 
ranged from 28.57% in the case of 
the homosexual males to 50% in the 
case of the females. In all groups, the 
percentage of those designating 
“male” were greater than those desig- 
nating “female”. Among the homo- 
sexual males, however, the responses 
were almost equally divided between 
the “male” and “female” designations. 
The female patients showed the high- 
est percentage of any of the groups in 
the preference for the designation of 


“ma e”’. 
The chi-square analysis of the data 
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TABLE I. Human Figure Responses to Rorschach Card III Designated 
as “Male”, “Female”, or “Neutral’”* by 347 Subjects. 


Group Responses 
“Male” “Female” “Neutral” Total 

% N % N % N 
Male Non-patients 12 37.5 6 18.7 14 43.8 $2 
Female Non-patients 19 36.5 8 15.4 25 48.1 52 
Male Patients 61 38.4 31 19.5 67 42.1 159 
Female Patients 47 42.72 8 73 55 50.0 110 
Homosexual Males 8 38.1 7 33.33 6 28.57 21 


*“Neutral” refers to those human figure responses where sex was not designated. 


Taste II. Chi Square Analysis of “Male” - “Female” Responses Showing the 
Significance of Distributions Within Each Group and 
Differences Between Groups. 


Group Compared with: 
M F MP FP HM 
Males (M) 2.00 
Females (F) .0679 4.16* 
Male Patients (MP) .0004 167 9.78°° 
Female Patients (FP) 3.241 2.545 7.66** 26.981** 
Homosexual Patients (HM) 465 958 656 4.500* 62 


* Significant at 5% level of confidence 
**Significant at 1% level of confidence 


Taste III. Chi Square Analysis of Combined “Male-Female” Responses and 
“Neutral” Responses Showing the Significance of Distributions Within 
Each Group and the Differences Between the Groups. 


Group Compared with: 
M F MP FP HM 

Males (M) 50 
Females (F) 129 076 
Male Patients (MP) 026 1.6016 3.954* 
Female Patients (FP) 610 2.637 1.734 0.00 
Homosexual Males (HM) 1.341 2.233 1.366 3.013 3.841* 
*Significant at the 5% level of confidence. 
is shown in Tables II and III. Exami- “neutral” responses by leaving the 
nation of the data in Table II reveals figures with sex undesignated. 
the following differences at the 5% 2. The homosexual male group gave 
level of confidence or above: sex designated responses more often 


1. Female non-patients tended to than neutral responses. 
designate “male” more often than 


“Comeie” CONCLUSION 
2. Male patients tended to designate The results of this study did not 
“male” more often than “female”. confirm the hypothesis that the Ror- 


3. Female patients showed a strong- schach Card III blot can be perceived 
er tendency to designate “male” than equally well as either male or female 
did any of the male groups. human forms. On the contrary, it 

Examination of the data in Table suggested that, for all groups tested, 
III shows the following differences the blot was more often seen as “male’ 
at the 5% level of confidence or or “neutral” than as “female”. This 


above: was most evident in the female patient 
1. More male patients designated group. 
the sex of the human forms seen on The limited number of subjects in 


Card III than the number who gave the non-patient male and the homo- 
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sexual male groups suggests further 
study of these groups would be 
desirable. 
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The Experimental Induction 
of the Defense Mechanism of Projection’ 





RosBERT ROSENTHAL? 
University of California at Los Angeles 


Since Freud first defined projection 
as “the ego thrusting forth onto the ex- 
ternal world whatever gives rise to 
pain internally” (12 v. 4, p. 78), the 
concept has generalized to such an ex- 
tent that today almost any idiosync- 
ratic perceptual behavior runs the risk 
of being labelled projection. Bellak 
probably perceived this when he wor- 
riedly agreed with Sears that Paes owt 
tion was the most inadequately defined 
term in all psychoanalytic theory (3). 
Actually it would seem that a good 
many writers define projection rather 
carefully and adequately (1, 3, 7, 15, 
30, 31, 32, 35) , but many of us generate 
confusion by our varying usages of the 
term. The process whereby the concept 
of projection has picked up all its sur- 
plus meaning is discussed in more de- 
tail elsewhere (24) . 

Considering the importance of the 
concept, relatively little controlled re- 
search has been carried out on projec- 
tion. Fabian (9) , Rokeach (23) , Sears 
(28), Zucker (36), and Norman and 
Ainsworth (20) defined projection op- 
erationally and except for Rokeach did 
find evidence for the operation of pro- 
jection. Before accepting the conclu- 
sions of these investigators, it seems es- 
sential to review exactly what they did 
to see if we can be satisfied that it was 
indeed projection they were investigat- 
ing. The basic designs of these studies 
were enough alike so that a general 
model may be constructed. A question- 
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* Now at the University of North Dakota. I 
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naire purporting to measure various 
traits is filled out by a group of people 
all of whom know each other. Each 
erson fills out the questionnaire once 
or himself and also once for every 
other person in the group according to 
how he feels that other person would 
fill in the questionnaire. Projection is 
said to occur when a person rates him- 
self low on a trait on which others rate 
him as higher, and rates others higher 
on that trait than they rate themselves. 
Thus projection is defined by a discre 
ancy score. This is an operational defi- 
nition and one cannot question the 
legitimacy of an operational definition 
—it is self-contained. One can, how- 
ever, question the usefulness or appro- 
priateness of any given operational 
definition. In these studies, because of 
the emphasis which the pathological 
features of projection have received, 
most of the traits investigated were un- 
desirable ones. These undesirable 
traits may consciously or unconsciously 
have been denied by a large percent- 
age of the subjects so that an empathic 
person correctly ascribing these traits 
to another would be likely to be ac- 
cused of projection where he actually 
showed ee In this type of study 
everything hinges on knowing whether 
a person “really” has a given trait or 
not. Through ratings or through tests 
this is a difficult question to answer. 
Perhaps a more important difficulty 
of these studies is that they in effect 
end by considering projection as a 
trait of which one can have more or 
less depending on the size of the dis- 
crepancy score. A trait is a static en- 
tity, but projection is widely thought 
to be a process (3, 12, 32). The defi- 
nitions of projection referred to earlier 
imply this in no uncertain terms. Thus 
Freud speaks of “the ego thrusting,” 
Bellak of “the ascription of feelings,” 
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and Symonds of “the reference of im- 
pulses.” These are all active, process 
terms rather than static, trait terms. 
We may thus conclude that as far as 
these controlled observations are con- 
cerned the process of projection has 
been at best touched upon but by no 
means demonstrated to occur. 

We may review briefly those investi- 
gations wherein an attempt was made 
to actually induce projection. Murray’s 
classic study (19) showed that children 
who were frightened perceived photo- 
graphs as more malicious than they did 
before their fear experience. While 
writers on the subject of projection 
have often quoted this study as evi- 
dence for the phenomenon, Murray 
himself considered this phenomenon 
to be “complementary apperceptive 
projection” rather than projection it- 
self. Bellak (3) and Rodnick and Kle- 
banoff (22) found that in response to 
criticism of story-telling and motor 
ability there was an increase in themes 
of aggression on the TAT. They cite 
this as evidence of projection insofar 
as the S’s aggression was “projected” 
onto the TAT cards. Since Bellak de- 
fines projection as “the ascription of 
feelings and qualities of one’s own to 
other people...” (3, p. 354) his find- 
ing might have been more direct and 
telling had he found that criticized Ss 

rceived more criticized figures on the 

AT. Nevertheless, these studies are 
the ones which yield the most sugges- 
tive evidence to date for the existence 
of the process of projection. 

In scrutinizing carefully the litera- 
ture on projection it would seem most 
appropriate to demonstrate the proc- 
ess of projection by inducing it. If we 
can induce it, we not only have evi- 
dence for its occurrence, but we are 
also assured that it will remain a proc- 
ess, in our thinking, instead of being 
viewed as a static trait-like entity. 

It might be well here to restate Sy- 
monds’ definition of projection as the 
“reference of impulses, thoughts, feel- 
ings, and wishes originating in the per- 
son himself to persons and objects in 
the outside world” (32, p. 296). For 
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experimental purposes it is necessary 
to limit this definition somewhat since 
impulses, feelings, wishes, and thoughts 
would be too awkward to investigate 
simultaneously. Evidence for the oc- 
currence of the process of projection 
will be considered to have been ad- 
vanced if known feelings can be in- 
duced in persons who can be made to 
show that they refer these experiment- 
ally induced feelings to others. 

Sears (29, p. 235) has said: ‘“‘Per- 
haps not the most compelling, but cer- 
tainly the most pervasive of all socially 
determined rewards are those leading 
to feelings of success or failure.” Since 
it was felt that this would be especially 
true for college freshmen, it was de- 
cided to choose the feelings of success 
and feelings of failure as those to be 
induced in the experimental Ss. 


METHOD 


Subjects 

One hundred and eight subjects 
were divided into three groups. There 
were thirty-six male paranoid-diag- 
nosed patients from a neuropsychiatric 
hospital, thirty-six male students and 
thirty-six female students all enrolled 
in first year psychology classes. The or- 
dinary objection to highly selective 
sampling due to using volunteers was 
not operative since the patients were 
requested by their physicians to par- 
ticipate and the college Ss were obliged 
to serve time as research Ss as part of 
their course requirements (5). The 
mean age of the patient group was 36.0 
years and their mean Shipley-Hartford 
1.Q. 110.3 (118.3 when only vocabu- 
lary I.Q. was used) . The mean age of 
college men was 20.1 and of college 
women 18.4. Thus while the three 
groups were roughly equated for I.Q. 
the patient group ran about seventeen 
years older than the mean of the col- 
lege Ss. The paranoid group, primar- 
ily paranoid-schizophrenics were se- 
lected by their physicians on the fol- 
lowing criteria: 1) that they be of at 
least average intelligence, 2) that they 
be in sufficient touch with the world to 
be able to understand moderately dif- 








assif 
set ¢ 
afte; 
serv 

y 
runt 
were 
pho 
suce 


pres 








ce 
its 
ite 
»C- 
on 
id- 
in- 


nt- 


er- 
er- 
lly 
ng 
ce 
lly 
de- 
ess 


cts 
ere 
ag- 
TIC 
ind 
led 


ive 
was 
ere 
yar- 
zed 
t of 
The 
6.0 
ord 
bu- 
> of 
ege 
ree 


een 
col- 
nar- 


se- 
fol- 
f at 





hey 
d to 
dif- 





ROBERT ROSENTHAL 


ficult instructions, and 3) that they be 
clearly and typically paranoid. All the 
physicians contributing Ss included 
“projection” as one of their chief cri- 
teria of a paranoid disturbance. Para- 
noids were included since so much of 
the literature links this specific dis- 
order with the defense mechanism of 
projection carried to pathological ex- 
tremes (18, 25, 31) . In addition, if any 
group of psychotic patients, especially 
schizophrenics, are to be included in 
a study with college students, it seems 
most appropriate to use paranoid pa- 
tients since these tend to be more near- 
ly of the same order of intellectual 
functioning. In general, too, paranoids 
are more like normals than are the oth- 
er schizophrenic nosologies (16, 17, 
21). 

College women comprised a separate 
group in view of Schilder’s (27) an- 
alysis of the different meaning of suc- 
cess and failure to men and women in 
our culture. In addition, Zucker’s work 
suggests a sex difference in the project- 
ing process (36) . 


Materials 


The Photo Card Sets. The stimulus 
materials upon which the Ss were to 
project their feelings were photo- 
graphs of men’s and women’s faces cut 
from advertisements in a weekly news 
magazine. They ranged in size from 30 
x 46 mm. to about 42 x 68 mm. and 
were mounted on 3 x 5 inch white 
cards in such a way that most Ss 
thought they were photos taken solely 
for the purposes of this study. Twenty- 
eight of the most ambiguous photo 
cards were assigned to one of two sets 
of fourteen each. The sets were 
“matched” by the method of random 
assignment (8) . Each subject rated one 
set of photo cards before and one set 
after the experimental treatment thus 
serving as his own control. 

The Rating Scale. On a rating scale 
running from —10 to +10 subjects 
were asked to rate the face on each 
photo card according to the degree of 
success or failure they felt was ex- 
pressed by that face. A —10 rating was 
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made when the S felt the face was of a 
person who had just been informed of 
a very severe failure in some important 
unspecified task. A Bag rating was 
made when the S felt the face was of a 
person who had just been informed of 
a very extreme success in some impor- 
tant unspecified task. A —1 rating was 
made when the S felt the face was of a 
person who had just been informed of 
a very mild failure. A +1 rating was 
made when the S felt the face was of 
a person who had just been informed 
of a very mild success. All other num- 
bers occupied intermediate positions 
between —10 and —] if negative and 
between +1 and +10 if positive. 

The “Intelligence Test.” In view of 
the paranoid’s frequent preoccupation 
with intellectual matters and the col- 
lege student’s necessary preoccupation 
with these, it was felt that strong feel- 
ings might easily be aroused in all Ss 
in connection with performance on an 
intelligence test. Two pseudo-intelli- 
gence tests were constructed which re- 
sembled some of Wechsler’s sub-tests 
(24, 34). One of the tests was analo- 
gous to Block Design while the other 
was very roughly analogous to Object 
Assembly. 


Experimental Design 


Experimental Treatment. The over- 
all design of the experiment was to de- 
termine whether the S’s experience of 
either success or failure on the “intelli- 
gence test” influences his ratings of suc- 
cess and failure on the photo cards. 
Each of the three groups of subjects; 
the patients, the college men, and the 
college women, were further subdivid- 
ed into three experimental treatment 
groups: a success group, a failure 
group and a neutral (control) group. 
The failure group was included be- 
cause it is widely felt that unpleasant 
feelings are projected. It is thus the 
primary experimental group which on 
the basis of the literature should show 
the projection process most clearly. 
The success group was included be- 
cause of a few writers’ (28, 32, 33) as- 
sertions that there is a “projection of 
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desirable and good” feelings and im- 
pulses onto other le as well as a 
“projection of the bad” (32, p. 297). 

The success and failure groups were 
read the same initial instructions (24) . 
They were told that the purpose of 
the study was to see if there was any re- 
lationship between “empathy” and 
“intelligence.” Insofar as could be de- 
termined no subject suspected the true 
intent of the experiment. The success 
and failure subjects were then shown 
how to make their ratings of the photo 
cards and told that they would after- 
wards take an intelligence test based 
on “the best intelligence test we use in 
psychology today.” Instructions to the 
neutral group include no mention of 
an intelligence test. Throughout the 
four items which comprised the “in- 
telligence test” the “success” S was al- 
lowed to finish and was warmly com- 
plimented. The “failure” S, however, 
was not permitted to finish any item 
and the E’s remarks became increas- 
ingly solicitous, e.g., “Perhaps we 
should try an easier one.” 

After having undergone the success 
or failure treatment, the S took anoth- 
er “empathy” test wherein he rated the 
other matched set of fourteen photo 
cards. If the hypothesis of projection 
were correct we would expect the fail- 
ure group to see more failure on the 
second rating than on the first. In 
terms of the ratings assigned each 
photo card this would mean that the 
S assigned a lower algebraic number to 
the - se presented after failure. We 
may call this change in rating a shift 
score. The shift score is positive in 
sign if more success is seen after treat- 
ment and negative in sign if more fail- 
use is seen after treatment. If the hy- 
pothesis of the “projection of the 
good” were correct we would expect 
the success group to see more success 
.on the second rating than on the first. 
The neutral S, instead of taking an 
“intelligence test,” was permitted to 
read the school paper between the first 
and second ratings. Presumably the 
neutral group’s second ratings should 
not differ significantly from their first 
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ratings. Before leaving this section it 
must be pointed out that in order to 
avoid any harmful psychological ef- 
fects of the failure Ss without giving 
away the intent of the experiment, an 
extension of the experimental design 
became necessary. All failure Ss were 
given the “success” treatment after 
their second ratings and told that only 
their best performance would “count.” 
Thus, while many failure Ss were quite 
upset by the failure treatment, none of 
them remained disturbed after their 
subsequent success experience. 

Level of Aspiration, Although it was 
felt that success and failure on an in- 
telligence test would represent an emo- 
tionally “real” experience for the Ss, 
the possibility still existed that some or 
all of the Ss would remain uninvolved 
(is. 14) . To this end, all Ss were given 

e Cassell Group Level of Aspiration 
Test (6). Frank and others (2, 6, 10, 
11, 26) point out that ego involvement 
is related to a high level of aspiration 
and that this level of aspiration is 
a fairly global characteristic of a per- 
son. Thus a high aspirer would be ex- 
pected to have a greater negative reac- 
tion to the experience of failure. In 
terms of the experiment, then, even if 
we found no projection by any of the 
treatment groups, we might neverthe- 
less find that those Ss who had a high 
level of aspiration did show projection. 
We would then be in a position to state 
that although projection was not gen- 
erally induced, it was induced in those 
cases where the S was ego-involved and 
cared about his success or failure. This 
in itself would be considered appro- 
priate evidence for the occurrence of 
projection. Accordingly, each of the 
three groups of subjects: the paranoid 

atients, the college men and the col- 
ege women were divided into high 
aspiring Ss and low aspiring Ss. In each 
group the eighteen Ss scoring highest 
on the level of aspiration measure be- 
came the “high aspirers” and the 
eighteen low scorers became the “low 
aspirers.” 

The Factorial Design. To summa- 
rize, three groups of Ss were each sub- 
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divided into three experimental treat- 
ment groups and each of the resulting 
nine groups were then divided into 
high and low aspiration levels. This 
3 x 3 x 2 factorial design is illustrated 
in Figure 1. Since it was not possible to 
be certain whether the two photo sets 
of the “empathy” test were really iden- 
tical, half of the subjects in each of the 
eighteen cubicles rated photo card set 
A first while the other half rated photo 
card set B first. This counterbalancing 
of photo card set presentation was de- 
signed to cancel out any differences due 
solely to the difference in photo card 
sets A and B. This of course expanded 
the factorial design to a 3 x 3x 2x 2. 
It will be remembered that the data 
going into each cubicle are shift scores. 
For example, a before-treatment rating 
of +-2 and an after-treatment rating of 
+6 yields a shift score of +-4. A before- 
treatment rating of —6 and an after- 
treatment rating of —2 also yields a 
shift score of +-4. 


n 
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RESULTS 


The analysis of variance based on 
the shift scores from the first to the 
second “empathy” test indicates wheth- 
er any changes have occurred which 
might be attributed to treatment, to 
group (or nosology) and to the two 
levels of aspiration. The eighteen cubi- 
cles formed by the three groups, three 
treatments and two levels of aspiration 
yield variance estimates based on three 
main effects, three simple interactions 
and one triple interaction. Since each 
cubicle was divided into two semi-cubi- 
cles differing in order of photo card set 
presentation, the within cubicle vari- 
ance was broken down into within 
semi-cubicle variance. There were four 
within cubicle sources of variance: (a) 
“persons within semi-cubicles,” was the 
variance due to individual differences; 
(b) “card order within semi-cubicles” 
was the variance due to the fact that 
the fourteen cards of each photo set 
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Tase I—Analysis of Variance Based 
on Pre-Posttreatment Shift Scores 


Source Mean Significant 
of Variance df Square Fs 
Aspiration 1 0.76 
Treatment 2 3.78 
Groups = 1.01 
Asp. x Treat. 2 32.26 
Asp. x Groups 2 743 
Treat.x Groups 4 16.00 
Asp. x Treat. 
x Groups 4 38.18 
Persons within 
semi-cubicles 72 31.01 
Card order 
within 
semi-cubicles 468 46.48 1.60* 
Interaction 
card order 
x person 936 = .29.17 (Error term) 
Residual 18 197.97 
Total 1511 


* Significant at the .01 level 


differed from each other and from the 
fourteen cards in the other photo set; 
(c) “interaction between persons and 
card order within a semi-cubicle” was 
the variance due to the differing cards 
and card sets’ differential effects on in- 
dividuals. Since this source of variance 
was the variance left after the known 
sources of within cubicle variance were 
removed (persons within semi-cubicles 
and card order within semi-cubicles) , 
it was used as the error term.’ (d) A 
residual variance representing the in- 
teraction between the order of photo 
card presentation and the differing 
semi-cubicles remained as an addition- 
al source of within cubicle variance. 
Table I shows the results of the analy- 
sis. None of the among cubicle sources 
of variance showed a significant F. 
Only the “card order within semi-cubi- 
cles” source of variance was significant. 
Our conclusion thus must be that by 
the experimental design employed 
projection was not demonstrated to 
occur. In addition it must be con- 
cluded that the difference between the 
two photo sets in amount of shift was 
markedly significant. 





* For working out many of the difficulties in 
this rather complex statistical treatment I am 
extremely grateful to Prof. J. A. Gengerelli. 


Experimental Induction of the Defense Mechanism of Projection 


DIscussION 


In this type of personality research, 
there is always the payee that the 
S’s knowledge that he is participating 
in an experiment makes it difficult to 
elicit responses (e.g. projection) which 
are typically thought to occur in “real” 
situations posing a threat to the ego. 
Thus the simple knowledge of his be- 
ing in an experiment may be enough 
to serve as a kind of antidote to experi- 
mentally induced stresses. This con- 
ceivable failure to ego-involve the Ss 
may account for the experimental fail- 
ure to demonstrate the occurrence of 
the projection process. In addition to 
this more general possibility there ap- 
pear to be three others more specific to 
this experiment which might account 
for this negative finding. 

The first of these is that the experi- 
ment may have in some way been in- 
appropriate to the problem at hand. 
Thus it may have been unrealistic at 
this stage of our knowledge in this area 
to attempt a precision of technique 
which may have been inconsistent with 
the precision of our backlog of empiri- 
cal information from which hypotheses 
must be derived. The second possibil- 
ity is that the definition of projection 
must remain, for a time at least, a “sen- 
sitizing concept” as Blumer has used 
this notion (4). That is, the clinician 
knows what he means when he uses the 
term and feels reasonably sure that his 
communicants hold roughly the same 
sensitization to the concept. Under this 
view it would be held that until we 
learn more about projection there is 
little use in trying to define it further. 
This is indeed a possibility but, it is 
felt here, a remote one. It appears more 
likely that by increasing the precision 
of our concepts we are put into a better 
position to gain empirical information 
about them. The first of the possibili- 
ties accounting for this failure to in- 
duce the projection process is a ques- 
tion of experiment while the second is 
a question of conceptual clarification. 
The possibility that these two objec- 
tions may not account for the negative 
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findings leads us to the possibility that 
the process of projection may not occur 
as different writers have formally de- 
fined this concept. It may be then, that 
the reason for this concept’s having 
gathered so much surplus meaning is 
chiefly that we just do not find the 
simple, uncluttered, clearly defined 
process in operation. Thus, in addition 
to our continuing clinical observations 
of the process of projection it would 
seem worth while to continue experi- 
mental attacks upon the problem in 
the hopes that we can clarify the proc- 
ess and perhaps specify the conditions 
under which it is likely to occur. 


SUMMARY 


The pnaee of this experiment was 
to test the hypothesis of projection, op- 
erationally defined, by a direct attempt 
to induce the process. Accordingly, 108 
Ss composed of 36 hospitalized para- 
noid patients, 36 college men, and 36 
college women were further divided 
into three sub-groups receiving success, 
failure, or neutral experience on an 
“intelligence test.” The nine sub- 
groups thus formed were further di- 
vided into high and low aspiring Ss 
within each sub-group, thus yielding 
an 18 cubicle factorial design. Data 
were based on shift or difference scores 
between an S’s pretreatment ratings of 
photos as more or less successful or 
more or less unsuccessful, and _post- 
treatment ratings. It was postulated 
that if the hypothesis of projection as 
defined by Bellak, Symonds, and others 
were tenable, the success group should 
see (rate) more success after treat- 
ment; and the failure group should see 
(rate) more failure after treatment 
when both groups are compared to the 
neutral (control) group. Paranoid 
patients were included among the Ss 
since the literature suggests that they 
are more prone to project. College 
women were included because of data 
suggesting sex differences in projec- 
tion. High aspirers were hypothesized 











to become more ego-involved with an 
intelligence test experience and thus 
perhaps show the process of projection 
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more clearly. The analysis of variance 
showed none of the among cubicle 
sources of variance to be significant. 
Thus there was no support for the hy- 
pothesis of projection as defined by 
Bellak, Symonds and others within the 
data of this experiment. Possible ex- 
planations for this finding were dis- 
cussed. 
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The Integration of Determinants and Content in 
Rorschach Interpretation’ 





David SHAPIRO 
Austen Riggs Center 


‘. 


First of all, I would like to state my 
understanding of the question that is 
presented to us at this symposium on 
“The Relative Importance of Deter- 
minants and Content in Rorschach In- 
terpretation.” I assume that by “im- 
portance,” simple empirical validity is 
not meant. That would be a perfectly 
legitimate question—by which method 
of interpretation one could obtain the 
best predictability in a given situation. 
But I am not sure how much would 
be learned from that question, just be- 
cause I am convinced that no really 
decisive answer could be obtained, that 
the answer would vary depending on 
the variable to be predicted and also 
on the skill and the direction of inter- 
est of the predictors. If the question 
were to be simply concerned with mat- 
ters of everyday, practical importance, 
I think it would be easier to argue for 
the greater importance of determin- 
ants, and for a very simple reason: the 
Rorschach happens to be an instru- 
ment the design of which is directed to- 
ward indicating certain formal ele- 
ments of individual perceptual tenden- 
cies rather than elucidating verbal con- 
tent. This is its unique contribution 
and its historical significance, and al- 
though it is theoretically possible that 
an instrument designed for one pur- 
pose would later turn out to perform, 
with optimal efficiency, another pur- 
pose, this would be essentially an un- 
likely coincidence. 

However, I assume that the question 
is a different one. Namely, is there a 
difference in the importance of these 





‘Originally presented at the symposium on 
“The Relative Importance of Determinants 
and Content in Rorschach Interpretation” 
sponsored by the New York Society for Pro- 
jective Techniques, May 17, 1958. 


two factors which is intrinsic to them 
and to the Rorschach test; are there 
theoretical reasons why, ultimately 
even if not at present, one approach is 
likely to reveal decisively more of what 
we are interested in than the other? 
This is the question that, it seems to 
me, must be answered negatively. I 
hope that my position will not be 
taken for fence-straddling on this mat- 
ter, because my argument essentially is 
for a different point of view, namely 
that each of these—determinant and 
content interpretation — is absolutely 
indispensible to optimal understand- 
ing of the other, and that the question 
of “relative importance” therefore can- 
not be answered one way or another 
for good reasons. What I will present 
to you could hardly be called a ration- 
ale for this position. It is only a frag- 
mentary sketch of the reasons for a 
point of view which, however, I have 
found absolutely necessary in clinical 
work. 


2. 


What we obtain in a Rorschach re- 
sponse or a Rorschach protocol] are es- 
sentially final, external products of in- 
internal processes. It is in the nature of 
such final external products, and espe- 
cially those that reflect, as the Rors- 
chach protocol does, highly complex 
internal processes, that they have in 
one particular sense a certain ambigu- 
ity. I mean by this simply that there is 
no one to one correspondence between 
a given final product and a specific in- 
ner process. This is not to question the 
determinism of such final products, 
such as an expressed thought or a piece 
of behavior, but only to say that the 
product itself does not necessarily re- 
flect the processes that preceded it in 
any unique way. While I think this is 
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one of the central problems of diagnos- 
tic work in general, it is also a com- 
monplace, and it is very easy to illus- 
trate: 

Imagine that you walk along the 
street and hear a man singing to him- 
self a popular, romantic love song. You 
listen carefully to his words; what do 
they tell you? As carefully as you may 
listen, the words will not tell you whet- 
her he is in love or isn’t in love, whet- 
her he is happy or sad, whether he is 
rehearsing a tune for an audition he is 
about to have or whether he really 
means, literally, what he is saying. Any 
of these internal processes and many 
more might have resulted in this ex- 
ternal product, and his words will not 
reveal which. In fact, if one would 
direct attention too closely to the con- 
tent of the song, that very fact might 
well preclude any clear impression at 
all of the man’s actual mood and in- 
terest. 

Consider another example. Suppose 
one is waiting outside a telephone 
booth and notices that the man who is 
using the phone is talking in a very 
excited, rapid, almost hysterical man- 
ner. Perhaps the impression is of some- 
one who is making a call which con- 
cerns some family emergency. But it 
might equally well be someone, with 
quite different feelings, trying hard to 
make a last minute plane reservation, 
or even someone telling a friend about 
a piece of very exciting and happy 
news. It is likely in this case that know- 
ing the exact content of the man’s 
speech would establish one or another 
of those possibilities as the actual fact. 
Such knowledge, of course, would then 
allow a fresh and more insightful ex- 
amination of the man’s excited man- 
ner. 

It seems to me that this fact that a 
given, final external product and in 
particular a single dimension of such a 
product need not necessarily reveal in 
a unique way the internal processes 
that precede and cause it is the gen- 
eral fact that argues against weighing 
the importance of one dimension of 
the Rorschach against another. It 
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seems that the Rorschach is such an 
excellent instrument just because it 
provides a number of dimensions, of 
which, incidentally, the determinants 
and the ideational content are only 
two of the major ones. 


3. 


I would like to make some points 
now concerning the nature of the 
Rorschach response and the processes 
behind it. 

The final Rorschach image, the fully 
conscious and externalized or ex- 
pressed Rorschach response is, in a way 
of speaking, not just the product of 
associative and perceptual operations 
or even of the interplay between them. 
That is, it is not merely a result. It is 
the terminal point of a process, but it is 
in itself, an integrative operation, The 
nature of, and the success or failure of, 
this integration is as we all know one 
critical factor in any interpretation. It 
is so clear that the integration of idea- 
tional content and the visual-percept- 
ual factors is the task in the Rorschach 
and that the task is neither simply per- 
ceptual nor simply associative, that 
when this integration fails, for exam- 
ple, in failure of sufficient associative 
distance from or elaboration of the vis- 
ual impression, as in color-naming, or 
in an incapacity to bind associative 
trends sufficiently to visual factors, we 
do not hesitate to consider this a re- 
flection of serious pathology, or, in 
some cases, we may rule this out alto- 
gether as a response proper. 

Nor do I believe it can be said, as 
I understand the so-called projective 
hypothesis to say, that the integration 
which constitutes the Rorschach task 
is only a compromise between associa- 
tive trends and external visual require- 
ments. Here again, I think that if in a 
Rorschach protocol one actually gets 
an impression of such a compromise 
between internal associative pressures 
and external, visual requirements, one 
marks it down as a less than adequate 
integration, as a response of less than 
good form-level, for example, and con- 
siders it to that extent a failure. 
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The visual qualities of the ink blots 
are not ordinarily a restriction on asso- 
ciative processes and a factor with 
which one compromises. The ink blot 
qualities are, within that situation, the 
vehicle of associative processes, their 
medium, and the perceptual organiz- 
ing tendencies that are brought to bear 
on these visual qualities may be com- 
pared with a syntax without which 
these associative processes cannot be 
expressed. As with any expressive med- 
ium or syntax, this one — the visual 
qualities available together with the 
organizing tendencies brought into 
play—shapes and lends its stamp to the 
content which is expressed through it.? 
Thus, one cannot view Rorschach im- 
agery as if, for example, it were dream 
imagery any more than one can view 
dream imagery as if it were waking 
daydream imagery, or that as if it were 
ordinary day to day discourse or con- 
versation. It is in the nature of human 
equipment to see things, to organize 
visual experience, to coordinate it with 
memory, interest, knowledge and such, 
and the ideational content that ap- 
pears in such circumstances cannot be 
most profitably viewed except as co- 
ordinated with that actual visual ex- 
perience. Those who work intensively 
with the Rorschach appreciate this 
fact intuitively. It is more or less built 
into many aspects of our approach to 
the Rorschach protocol, and it is built 
into our recognition as pathological of 
any incapacity to satisfy the ordinary 
standards of such coordination be- 
tween visual and ideational factors. 
Naturally, this is not to say that idea- 
tional content or mode of expression 
in the Rorschach situation is unre- 
lated to ideation which appears in 
other contexts, that a great deal may 
not be learned about one from the 
study of another, or that certain prin- 
ciples or individual modes of organi- 
zation may not be common to all. 

The Rorschach, in addition to its 
ideational aspect, allows for a high de- 


*For a demonstration of this point, Werner 


and Kaplan’s (1) work on expression in line 
drawings is valuable. 
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gree of individual inclination or per- 
sonal style in perceptual organization. 
We know of course that the individual 
perceptual style which we see in a Ror- 
schach protcool, or rather than speak 
of a single perceptual style, the or- 
ganization of a variety of styles and 
tendencies which may express them- 
selves in a protocol, reflects broader 
modes of expression and of character 
organization. The recent experiment- 
al work in individual perceptual and 
cognitive styles, for example of Klein 
and his associates, is after all in its 
outlook not new to us. For us, the for- 
mal characteristics of the Rorschach, 
including particularly the determin- 
ants, are the data with which we arrive 
at an understanding of individual per- 
ceptual style and organization. ‘The 
determinants have this meaning or 
none at all. 

It could be’argued with some justi- 
fication that the determinants have not 
always clearly and explicitly had this 
meaning in the past. The various det- 
erminants have been attached, with a 
greater or lesser degree of empirical 
validity, to a hodgepodge of human 
traits which seem impossible to sys- 
tematize. Color attached to affect as 
though the color itself actually con- 
tained some affect deposit and with in- 
sufficient understanding of the percep- 
tual processes through which such a 
relationship may, or may not, be estab- 
lished; shading attached to anxiety; 
FM to impulses of some sort; M to 
empathy; and so on. In spite of this 
state of affairs, we still must assume 
that the determinants fundamentally 
have their meaning as reflecting formal 
aspects of perceptual style and organi- 
zation, and that their further inter- 
pretive or indicator-meaning must ul- 
timately be understood in that sense. 

Just as content gets its sense and its 
tone through the medium and style of 
its expression, so the expressive style is 
unthinkable without content. It is, 
again, the whole sense and direction of 
visual and perceptual equipment to see 
things, and the nature of the things 
which are seen, the quality of the 
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things that are of interest, in other 
words, the visual content, will give the 
perceptual style its final and definite 
shape. To use an analogy which is not 
too distant from our interest, one can- 
not be simply compulsive, or depres- 
sive, or repressive. Compulsiveness can 
exist only with a content of action or 
ideas, and the nature of these actions 
and ideas which means, also, the nat- 
ure of the material available for them, 
will determine the final form of com- 
pulsive tendency. 


4. 


My aim is not to convince you that 
you should rely just equally on con- 
tent and on determinant interpreta- 
tion in Rorschach work. It seems to me 
that perfectly legitimate differences of 
interest, natural inclination or back- 
ground will tend to cause each one to 
give more weight to one side than to 
the other. Also, there is no doubt that 
there are differences in this respect 
among different Rorschach protocols; 
some, quite legitimately I think, seem 
to call for more emphasis on content 
interpretation and others on determin- 
ant interpretation. The aim here is, 
rather, to-show that there is no reason 
intrinsic to the test to consider one as- 
pect more important than the other, 
and, in addition, to show that each 
aspect can be properly understood only 
in the light of the other. 

I would like to illustrate this, not 
exhaustively, but with certain prob- 
lems which I think are common or 
typical ones. I will take two sorts of 
problems for this purpose: 1) the prob- 
lem of levels or the problem of refer- 
ence in content interpretation; and 2) 
the problem of specificity in interpre- 
tation which relies on determinants 
primarily. 

Let me give an example in order to 
clarify the meaning of this problem of 
levels or problem of reference in con- 
tent interpretation. We are all familiar 
with certain Rorschachs in which there 
is a steady current of sado-masochistic 
imagery. This imagery, for instance, 
may include such themes as tearing, 


cutting, beating, kneeling, or other 
themes suggestive of sadistic action, 
humiliation, or the like. There are sev- 
eral interesting problems in connec- 
tion with the interpretation of a con- 
tent theme such as this, but for the 
time being we will consider only one. 
The action-mode suggested by this con- 
tent, namely sadism, has a certain am- 
biguity, and it retains this ambiguity 
no matter how precisely, by tracing 
the content theme throughout the pro- 
tocol, one is able to delineate its de- 
tails. The ambiguity may be described 
in several ways. It may be described as 
an ambiguity as to subject and object 
—on which end of the sadistic action 
does the person who gives the response 
feel himself to be? Or one can put the 
same question in a different way and 
ask, is the action with which the re- 
sponse-giver is primarily identified a 
sadistic or a masochistic action? 

It might be said that this supposed 
ambiguity is only apparent, and that a 
great deal of clinical experience as well 
as certain theoretical considerations 
make it very clear that the apparent 
ambiguity is only an expression of the 
fact that these two positions—subject 
and object, or these two interests, sad- 
istic and masochistic—are always and 
without fail present together within an 
individual. Essentially this is true, 
but it only sharpens the problem and 
makes it more clearly a problem of 
levels, for the fact remains that there 
are individuals for whom the sadistic 
mode is by far closer to conscious at- 
titude and behavior, others for whom 
the masochistic mode is closer to con- 
scious attitude and behavior, and per- 
haps a third group who may in fact be 
characterized primarily by vacillation 
in attitude and behavior between the 
two positions depending on specific 
circumstances. The question is, how do 
we choose among these possibilities on 
the basis of Rorschach content. 

The crux of this problem of levels, 
you will notice, is this: the imagery 
content, the fantasy or memory or in- 
terest elements that are utilized in the 
construction of the Rorschach image 
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or the response content are, or at least 
may be, exactly the same in all these 
cases. What one does not know, and 
what one may very well have, from the 
content dimension alone, a difficult 
time finding out, is the exact relation- 
ship of this imagery to the person who 
gave it, its sense and meaning to him, 
or the mood and attitudes from which 
it emerges. Thus, the meaning and the 
value of this content interpretation re- 
mains seriously limited, in the illustra- 
tion, to speaking only of a general 
sado-masochistic mode, until one can 
answer these questions. 

Where will the answer come from? 
It will come most often, I think, from 
an understanding of the formal quali- 
ties of perceptual organization that 
appear in the same Rorschach. The 
way in which the visual material is 
shaped and handled is another aspect 
or dimension of the responses whose 
imagery content one has been examin- 
ing, and this other dimension will or- 
dinarily give clues to the answers one 
needs, that is, to the prevailing atti- 
tudes and orientation in the context of 
which the imagery has its full mean- 
ing. 

I cannot present here a case inter- 
pretation, but I would like to describe 
what I mean more specifically. The 
same content imagery—cutting, beat- 
ing, etc.—will sound different and will 
mean something different in a Ror- 
schach with a large number of re- 
sponses, with a great predominance of 
pure form responses of excellent form 
level, with little or no color or only 
color that is decisively subordinated to 
form, different in such a record from 
what it will mean in a record of few 
responses, of proportionately fewer 
pure form responses and those of a 
gross, more careless quality, with a 
large number of color-dominated re- 
sponses and the immediate and pas- 
sive relationship to the stimulus that 
those tend to imply. What is being 
suggested here, of course, is an active, 
forceful, penetrating perceptual style, 
on the one hand, and a more passive, 
giving-one’s-self-over to the visual ma- 
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terial on the other. The same imagery 
content may appear in these two wide- 
ly different contexts, but it will have 
different meaning in each. 

This is only one of a large number of 
such problems of level. Phobic and 
counter-phobic issues contain some- 
what similar problems. Broader dimen- 
sions such as masculinity-femininity 
and aggression-passivity will also need 
to be considered in the same light. 

Take another type of problem that, 
again, for the sake of convenience, may 
be called a problem of levels. A 35- 
year-old man gave the following re- 
sponse to the usual area of card VII: 

“They look like two old, gossipy women. 
I noticed the dark shading (in their faces)— 
it looks like somebody has thrown blueberry 
pies in their faces, like in the old slapstick 
comedies.” 

It does not take much ingenuity to 
detect an aggressive aspect to this re- 
sponse. And in fact, this man’s Ror- 
schach contained a number of other re- 
sponses similarly marked by aggres- 
sive imagery. We will assume now, 
which is quite justified in the circum- 
stances, that these responses reflect an 
active and aggressive interest more 
than a subjective feeling of being pas- 
sively aggressed upon. Assuming this, 
what do we learn from this sort of con- 
tent and what do we not learn? It is 
again a fact of the type that I suggest- 
ed in my analogies at the beginning 
that identical aggressive imagery, of 
this sort for instance, may be given on 
the one hand by individuals who 
would not hesitate a moment to act 
aggressively, physically, and on the 
other hand by individuals who would 
never seriously consider such action. 
Not that these latter individuals are 
necessarily sweet Pollyannas—if that 
were the case, they probably would not 
give much imagery of this sort in the 
first place. But there are individuals, 
for example those characterized by 
what is called isolation of affect, whose 
predominant mode is in fact an active 
and in a sense aggressive one, and who 
are quite capable of entertaining on a 
certain level a good deal of aggressive 
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imagery, but whose attitude toward 
that imagery is quite detached. This 
man, for example, was a lawyer, who 
was not tested as a patient and in 
whom there was no outstanding evi- 
dence of pathology. The response con- 
tent was, as far as he was concerned, a 
little joke. The formal Rorschach pic- 
ture, the determinants, in this case 
were quite typical of compulsive peo- 
ple; the form level was impeccable. It 
is true that this sort of imagery reflect- 
ed a somewhat lofty and condescend- 
ing, possibly teasing attitude toward 
women on the part of the subject. But 
that is quite a different story from raw 
aggression. One must conclude that 
content elements such as these which 
may in themselves be identical would 
mean very different things in this 
man’s Rorschach and in another char- 
acterized, for instance, by one or two 

ure C’s, unsteady form level, and 
such. The content elements would 
have a different setting, would be 
drawn from a different level, so to 
speak, in the organization of thoughts 
or fantasies or memories. 

I am not at all sure that the prob- 
lems which might be called problems 
of content reference are essentially 
very different from the problems I have 
just indicated, These problems of con- 
tent reference revolve around several 
facts, the first of which is that many 
images, though not necessarily ambig- 
uous or esoteric, may have such a vari- 
ety of connotations that the interpre- 
ter, unless he is satisfied to be arbitrary, 
is mystified about their specific conno- 
tation on a given occasion. 

Take the simple response “waiter”’ 
to card III. Is this an image with oral 
connotation, or with a connotation of 
servility and status, or one of formality 
and restraint, or some other? It is true 
that many times these questions can be 
answered by study of other content or 
of general content themes. But not 
always. 

For example, there may be a general 
content theme around images.of flow- 
ers and exotic decorations; does this 
have a connotation of artistic taste and 
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sensitivity, or one of compulsive and 
precious pseudo-esthetic interest, or 
one simply of sensuousness? The for- 
mal qualities of the perceptual style 
may, again, give some clue as to the 
actual connotation for the subject or 
may at least limit the range of possi- 
bilities, and therefore contribute to the 
content interpretation which is most 
valid. 

Or consider another example, this 
time a young man’s Rorschach in 
which there was great emphasis on im- 
ages of food, more so than I recall hav- 
ing ever seen elsewhere. One might 
think that the connotation of food is 
clear enough. Perhaps in a camp of 
hungry or half-starved people it would 
be. But in our milieu, though it can 
have connotations of oral or passive 
craving, it can also have connotations 
of stubborn negativism, or sensuous in- 
dulgence, or many others. This record, 
formally, was a poorly controlled one, 
rather wild and reckless in form and in 
color-use, but at the same time with 
many responses that bespoke a kind of 
narcissistic self-confidence through all 
the reckless and impulsive judgments. 
I would say that the connotation of 
food, to this subject, was much more 
one of indulgence, a spoiled-child’s 
tyrannizing demandingness, than the 
depressive or lonely cravings it may 
sometimes suggest. 

The point may be summarized by 
saying that the real meaning of con- 
tent is largely dependent on the kind 
of thought organization from which it 
is drawn, the subjective attitude to- 
wards it, and the personal connota- 
tions which it may have. But the es- 
‘ttablishment of these facts, and there- 
fore the most adequate content inter- 
pretation, will usually require an un- 
derstanding of the quality of the per- 
a organizing tendencies through 
which this content has gained expres- 
sion. 


5. 


What about the use of content for 
determinant interpretation? I think 
that here it is necessary to distinguish 
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two types of problems, one involving 
practical problems which derive from 
the limitations of our present knowl- 
edge techniques, and the other involv- 
ing problems more intrinsic to our 
material. 

In speaking of the first type of prob- 
lem, I have in mind the types and 
categories in which we classify or dif- 
ferentiate the formal qualities of per- 
— in other words, our scores. 
These simply are too gross for our 
needs, and the various traits which 
they are said to reflect, even if in that 
respect they are perfectly accurate, are 
likewise not nearly specific enough. 
High form level, low form level, pure 
color, form-dominated color or, con- 
trol, affect, impulsiveness, and such— 
these dimensions, as significant as they 
are, are simply in themselves insuffi- 
ciently specific. At least they are in- 
sufficient if one wants to go beyond 
more or less gross diagnostic evalua- 
tions in test work. 

But there is a more important and 
perhaps more interesting issue in- 
volved, namely, is it in the nature of 
purely formal modes, such as the per- 
ceptual modes which are represented 
for our purposes here by the deter- 
minants, ever to have the degree and 
kind of specificity which we require in 
our work? Or to put the question an- 
other way, can we look forward to the 
day when purely formal interpretation 
will be sufficiently specific. My own 
belief is that we cannot. I have already 
suggested what seems to me the essen- 
tial reason for this; that is, that, limit- 
ing ourselves for the present to speak- 
ing of formal modes of perception, 
these modes come into being or are 
manifest only with some sort of idea- 
tional content, and it is in fact this 
content which crystallizes their final 
and specific shape. 

What does this mean concretely? We 
can find some illustrations in connec- 
tion with the use of the color-deter- 
minant. 

Consider three color responses, or, 
let us assume, pure color responses: 
blood, spilled (colored) ink, and urine 


os 
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stains. If these three responses are all 
scored pure C, then by definition with- 
in our present scheme of describing 
these formal qualities of perception, 
the three responses are not distinguish- 
able from one another. It is not that I 
doubt that they are different; I am sure 
that they are different, not only in con- 
tent but in the formal quality of the 
perception. But these are differences 
which may very well not be detectable, 
and in this particular case are certain- 
ly not detectable, on the basis of the 
formal score alone. Why should this be 
so? Essentially, I believe, the answer is 
not in an insufficiency of our scorin 
systems, but rather because in the fina 
moment, when the Ps sews tend- 
ency is integrated with ideational con- 
tent, it receives its essentially unique 

uality and stamp which may, to one 
Sunes or another, be special to that 
moment and fo that content. And the 

uality and tone of the content is the 
anise clue to the final shape which 
the broad perceptual tendency has at 
that moment taken. 

In my experience, the “blood” re- 
sponse is most typically found in hys- 
terical people where it seems to have 
the connotation of high pressure of af- 
fect, the “spilled ink” response is found 
often in impulsive, narcissistic charac- 
ter disorders who, as far as affect in the 
ordinary sense is concerned, tend to be 
rather cool and even bland, and the 
“urine stain” in chronic schizophrenics 
where the same formal score and broad 
perceptual mode reflects a severe and 
pervasive blandness. I have very little 
doubt, myself, that the color at that 
moment of response looks different to 
subjects in each of these three cate- 
gories. But the key to how it looks, to 
the exact sense and quality of the per- 
ceptual mode at that moment, will not 
necessarily come from the formal score. 
Nor would it solve the problem funda- 
mentally if we had finer, more highly 
differentiated scores for these re- 
sponses. That might be helpful, but no 
imaginable number of divisions could 
approach the number of specific qualli- 
ties and tones which the broad per- 
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ceptual mode is capable of taking in 
final expression. Ultimately the best 
clue to how the visual stimuli are per- 
ceived at the moment of response will 
come from the specific content of the 
response. 

Any number of other examples, and 
many striking ones, are easy to come 
by among color responses. How else, if 
not by the whole tone and sense of 
the content, could one get a clue to 
the vivid, compelling sensory quality 
of a response such as this one given by 
an acutely paranoid psychotic boy to 
the center red area of card III: 

“Like bright headlights on a car—like when 
you’re on a beach at night and a car points its 
headlights at you.” 

Or the different quality of vividness 
in a hypomanic woman’s response to 
all of card IX: “An orgasm in color.” 

Determinants — major categories of 
formal perceptual qualities — tell us 
something in these cases. But the fact 
is, again, that it is simply not in the 
nature of these categories to delineate 
with sufficient sharpness the visual and 
perceptual operations which can be of 
value to us in understanding the spe- 
cific condition of the subject. 

Examples could, of course, be multi- 
plied with other determinants. In con- 
nection with the form determinant, for 
example, we already include in our 
scoring one attempt to specify the per- 
ceptual tendency more sharply by ref- 
erence to the adequacy of integration 
of visual and content factors. I am re- 
ferring to form level scoring. And ad- 
ditional gains in sharper specification 
are made in scoring, for example, 
“vague form” separately. The advant- 
ages of such finer form scoring are par- 
allel to the advantages of scoring 
“deterioration color” separately, and 
all such attempts necessarily consider 
aspects of content or integration of 
content and visual quality in their at- 
tempts to delineate the formal percep- 
tual style more finely. These distinc- 
tions are always valuable, and there 
will be room for finer scoring differ- 
entiations of this sort up to the point 
where the scoring system will be too 


cumbersome. But at the same time, it 
will be well to recognize the funda- 
mental quality of these problems and 
to recognize that adding scoring differ- 
entiations here and there will never 
enable us to obtain as adequate a view 
of the special perceptual tendency at 
the moment of response as will, often, 
the content of the specific image that 
has become the final and externalized 
shape of that tendency. 


6. 


I am well aware that I am not say- 
ing anything new to experienced Ror- 
schach workers. I believe in fact that 
the method and approach that I have 
been describing is one of the opera- 
tions implied in the term “clinical in- 
tuition” as applied to Rorschach inter- 
pretation. But it is of value to make 
such operations explicit as far as we 
can. For one thing, they can throw ad- 
ditional light on such a question as 
the one we are discussing today; per- 
haps secondly, they may lay to rest 
some of the experimental research 
methods which tally up contents or 
determinants as though each and every 
time these had the same subjective 
meaning; and finally, they may sug- 
gest a more rigorous clinical applica- 
tion of this approach. I think that it 
may be just because the content dimen- 
sion used in the way I have described 
is sometimes overlooked, that formal, 
determinant interpretation tends to be 
too gross in its descriptions and tends 
to lack a sharpness that gives a sense 
of conviction that a real human being 
is described. On the other hand, I ex- 

ct that it is inadequate use of the 
ormal, or determinant, dimension 
that causes content interpretation to 
be, in my opinion, almost always sharp- 
ly detailed, but often incorrect. 

I would like to make one final point. 
When we work actively with at least 
two dimensions of a production in this 
way, there is a special additional gain: 
we have a built-in criterion of ade- 

uacy, From this point of view, any 
orschach interpretation is less than 
adequate if it does not succeed in an 








i 
c 
c 
t 
V 
a 
c 





r- 
at 
ve 


n- 
r- 


d- 


al, 


ads 
nse 
ing 
ex- 
the 
ion 
to 


irp- 


int. 
east 
this 
1in: 
ade- 
any 
han 


1 an 








Davin SHAPIRO 


integration of every formal and every 
content aspect, and as long as one is not 
clear in the light of the other, we know 
that there is something missing. I 
would not say that this approach guar- 
antees correctness, but I think it makes 
correctness much more likely than any 
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other method. 
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ceptual mode is capable of taking in 
final expression. Ultimately the best 
clue to how the visual stimuli are per- 
ceived at the moment of response will 
come from the specific content of the 
response, 

Any number of other examples, and 
many striking ones, are easy to come 
by among color responses. How else, if 
not by the whole tone and sense of 
the content, could one get a clue to 
the vivid, compelling sensory quality 
of a response such as this one given by 
an acutely paranoid psychotic boy to 
the center red area of card III: 

“Like bright headlights on a car—like when 
you're on a beach at night and a car points its 
headlights at you.” 

Or the different quality of vividness 
in a hypomanic woman’s response to 
all of card IX: “An orgasm in color.” 

Determinants — major categories of 
formal perceptual qualities — tell us 
something in these cases. But the fact 
is, again, that it is simply not in the 
nature of these categories to delineate 
with sufficient sharpness the visual and 
perceptual operations which can be of 
value to us in understanding the spe- 
cific condition of the subject. 

Examples could, of course, be multi- 
plied with other determinants. In con- 
nection with the form determinant, for 
example, we already include in our 
scoring one attempt to specify the per- 
ceptual tendency more sharply by ref- 
erence to the adequacy of integration 
of visual and content factors. I am re- 
ferring to form level scoring. And ad- 
ditional gains in sharper specification 
are made in scoring, for example, 
“vague form” separately. The advant- 
ages of such finer form scoring are par- 
allel to the advantages of scoring 
“deterioration color’ separately, and 
all such attempts necessarily consider 
aspects of content or integration of 
content and visual quality in their at- 
tempts to delineate the formal percep- 
tual style more finely. These distinc- 
tions are always valuable, and there 
will be room for finer scoring differ- 
entiations of this sort up to the point 
where the scoring system will be too 





cumbersome. But at the same time, it 
will be well to recognize the funda- 
mental quality of these problems and 
to recognize that adding scoring differ- 
entiations here and there will never 
enable us to obtain as adequate a view 
of the special perceptual tendency at 
the moment of response as will, often, 
the content of the specific image that 
has become the final and externalized 
shape of that tendency. 


6. 


I am well aware that I am not say- 
ing anything new to experienced Ror- 
schach workers. I believe in fact that 
the method and approach that I have 
been describing is one of the opera- 
tions implied in the term “clinical in- 
tuition” as applied to Rorschach inter- 
pretation. But it is of value to make 
such operations explicit as far as we 
can. For one thing, they can throw ad- 
ditional light on such a question as 
the one we are discussing today; per- 
haps secondly, they may lay to rest 
some of the experimental research 
methods which tally up contents or 
determinants as though each and every 
time these had the same subjective 
meaning; and finally, they may sug- 
gest a more rigorous clinical applica- 
tion of this approach. I think that it 
may be just because the content dimen- 
sion used in the way I have described 
is sometimes overlooked, that formal, 
determinant interpretation tends to be 
too gross in its descriptions and tends 
to lack a sharpness that gives a sense 
of conviction that a real human being 
is described. On the other hand, I ex- 
nee that it is inadequate use of the 
ormal, or determinant, dimension 
that causes content interpretation to 
be, in my opinion, almost always sharp- 
ly detailed, but often incorrect. 

I would like to make one final point. 
When we work actively with at least 
two dimensions of a production in this 
way, there is a special additional gain: 
we have a built-in criterion of ade- 

uacy. From this point of view, any 
orschach interpretation is less than 
adequate if it does not succeed in an 
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integration of every formal and every 
content aspect, and as long as one is not 
clear in the light of the other, we know 
that there is something missing. I 
would not say that this approach guar- 
antees correctness, but I think it makes 
correctness much more likely than any 


other method. 
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Mira Y Lopez, Emilio, Myokinetic Psy- 
chodiagnosis, New York: Logos Press. 
1958. (Translated from the French by 
Mrs. Jacques Dubois.) 


This book, originally published in French 
in 1951 has been translated under the editor- 
ship of Leopold Bellak, Michael H. P. Finn, 
Leonard Small and Frances Bishop. It is a 
manual for the use of the Myokinetic Psycho- 
diagnostic technique which is an unstructured 
graphomotor approach to the investigation of 
personality characteristics. Six sets of simple 
line figures are to be reproduced by the sub- 
ject under varying conditions. Both hands are 
used for the drawings. For the main portions 
of his responses the subject is not permitted to 
see the movements he is making but must 
depend on kinesthesis. A special test booklet 
and a special table permitting controlled tilt- 
ing are required. 

The test is based on the now familiar con- 
cept that bodily movements reflect personality 
characteristics. The use of both hands provides 
an opportunity to make comparisons based on 
the belief that “The dominant hand is ex- 
pected to show more ego expression in that it 
exercises more practical control; the weaker 
hand, lacking the same degree of voluntary 
corrective control, is looked upon for more in- 
stinctive unconscious expressions.” There are 
brief comments about the interpretation of 
the graphic responses. These are based on 
linear and chronological measurement as well 
as some special pattern variations. Intelligence, 
depression and euphoria, intrapsychic coher- 
ence (integratedness) excitation and inhibi- 
tion, anxiety and apathy, aggressiveness, intro- 
version and extroversion are among the areas 
dealt with. There are also suggestions for use 
of the instrument for diagnostic differentia- 
tion. Some normative tables are provided 
based on populations other than those of the 
United States. Some statistical evidence of reli- 
ability and validity are included. 

As the editors point out, in its present form 
the technique is too cumbersome for the prac- 
ticing psychologist but certainly provides an 
intriguing opportunity for research. There has 
been a great deal of research outside this coun- 

try but little here. Although there are other 
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graphomotor approaches such as that by Gehl 
and Kutash, Rapkin, and others, Mira’s or- 
ganized approach, using both hands for the 
comparative purposes he does, is a distinctive 
one that merits further consideration by psy- 
chology in the United States. However, the 
instructions both for administration and in- 
terpretation are extremely meagre and at 
many points difficult to follow. At this time, as 
implied by the editors, the translation of the 
manual has its greatest value as a stimulant 
to those who would do research in the area of 
graphomotor expression. 


MortTIMER M. MEYER 
Los Angeles 


W. J. Revers, Der Thematische Ap- 
perzeptionstest (TAT). Hans Huber; 
Bern and Stuttgart, 1958. 209 pp. 


This is an elementary text on the TAT by 
a German psychologist. After the introduction, 
there is a description of the test, application 
and administration, including a _ supple- 
mentary interview and a discussion of various 
methods of diagnosis using the TAT. He con- 
cludes with a section on unresolved problems, 
and one on the psychocathartic side-effects 
(Nebenwirkung) of the TAT. 

For American psychologists the book is of 
interest solely for comparative purposes. It 
contains no new material, and recommends 
some procedures, such as using the TAT to 
stimulate psychotherapeutic discussions, 
which most psychologists here would consider 
to be at least of questionable value. 

Almost all the literature cited is from Amer- 
ican sources. However, he has relied mainly 
on published books and relatively little on 
journal articles. 

Inasmuch as a number of stories and pro- 
tocols are cited from German sources, these 
could be used for comparison with stories 
from other countries by anyone interested in 
the “culture-bound” aspects of the TAT. 

The book is Vol. 2 in the Encyclopedia of 
Psychology in Individual Presentations, a 
Hans Huber series. Vol. 1 is a translation of 
Gordon Allport’s “‘Becoming.” 
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GENERAL NEWSLETTER 


Alvarez Villar, A. & Narros Martin, 
G. La percepcion visual en las diversas 
entidades nosologicas psiquiatricas. 
Revista de Psicologia general y apli- 
cada, 1958, 13, 739-766. 


A perceptual test, the Progressive Matrices 
and the Rorschach were administered to sev- 
eral diagnostic groups. Empirical findings and 
theoretical points are presented. 


Schachter, M. Psicodiagnostico, con 
el test de Rorschach, del caracter opo- 
sicionista e inestable. Archivos de 
Criminologia, Neuro-Psiquiatria y Dis- 
ciplinas Conexas, 1959, 7, 14-21. 


The author describes two Rorschach vari- 
ables which he finds characteristic of the pro- 
tocols of unstable and oppositional personal- 
ities. 


Porteous, Hedy S. Enuresis: the 
symptom and the character structure, 
Its relationship to the oedipal conflict. 
A Crianca Portuguesa, 1958, 17, 665- 
681. 


The psychoanalytic theory of enuresis is 
documented with a discussion of drawings. 


Dabrowski, K. La desintegration men- 
tale comme facteur positif dans le de- 
veloppement emotionel de l’enfant. A 
Crainca Portuguesa, 1958, 17, 731-736. 

Indications of positive features among signs 


of disintegration in children are discussed and 
related to the Rorschach. 


Menano, Cecilia. Importance du tra- 
vail colletif dans le dessin et dans la 
peinture libres. A Crianca Portuguesa, 
1958, 17, 809-810. 

A brief report on the use of children’s group 
participation in creating designs and paint- 
ings. 

dos Santos, J. & Menano, Cecilia. 
L’acquisition de la forme dans les des- 
sin et dans la peinture enfantines. A 
Crianca Portuguesa, 1958, 17, 811-813. 

A description of stages of development in 
children’s free drawing and painting. 


Diaz Arnal, Isabel. Valeur diagnos- 
tique du dessin des enfants deficients. 
A Crainca Portuguesa, 1958, 17, 937- 
938. 


Diagnostic clues in children’s drawings. 


Meili, R. Das Problem der Validation 
der Methoden der Personlichkeitsun- 
tersuchingen. Zeitschr. f. diagnost. Psy- 
chol., 1958, 4, 283-303. 

A theoretical discussion of validity studies 
and theor implications. 


Lucena, J., Borges, J. C. C., Sette, P. 
E. & Souto, Maria do C. Le Bender Ges- 
talt Test et quelques donnees d’analy- 
ses structurales chez des schizophrenes. 
Zeitschr. f. diagnost. Psychol., 1958, 4, 
313-316. 

Significant differences were found on the 
Bender between acute and chronic schizo- 
phrenics. 











ANNOUNCEMENTS 


TWENTIETH ANNUAL PROGRAM 
of the 
Society for Projective Techniques 
CINCINNATI, OHIO 
FRIDAY, SEPTEMBER 4, 1959 


9:00 - 10:50 A.M. 

Symposium: The Role of Stimulus Structure 
in Projective Techniques. Dr. Edwin S. 
Shneidman, Central Research Unit, VA Cen- 
ter, Los Angeles, Chairman. 

Participants: Dr. Kenneth B. Little, Nation- 
al Institutes of Mental Health; Dr. Bernard I. 
Murstein, Univ. of Portland; Dr. Edith Weiss- 
kopf-Joelson, Purdue Univ.; Dr. Ira Friedman, 
Cleveland Psychiatric Institute. 

Discussants: Dr. Jerome Kagan, Fels Re- 
search Institute; Dr. Joseph Zubin, N.Y. State 
Dept. of Mental Hygiene. 


SATURDAY, SEPTEMBER 5, 1959 
11:00 - 12:50 A.M. 

Symposium: The Prediction of Overt Be- 
havior Through the Use of Projective Tech- 
niques. Dr. Arthur C. Carr, New York Psychi- 
atric Institute, Chairman. 

Participants: Dr. William E. Henry, Univer- 
sity of Chicago: ““The TAT”; Dr. Bertram R. 
Forer, Los Angeles VA Mental Hygiene Clinic: 
“Sentence Completion Tests”; Dr. Max Hutt, 
University of Michigan: “Graphomotor Tech- 
niques (Bender-Gestalt Test & Draw-a-Person 
Test)”; Dr. Zygmunt A. Piotrowski, Jefferson 
Medical College: “Rorschach Test”. 

Discussant: Dr. Evelyn Hooker, U.C.L.A. 

3:00 - 4:00 P.M. 

Business Meeting, Society for Projective 
Techniques. 

4:00 - 5:00 P.M. 

Presidential Address, Society for Projective 
Techniques. 

Dr. William E. Henry: Personality Develop- 
ment in Adulthood and Old Age. 


5:00 - 6:30 P.M. 
Dutch Treat Cocktail Party, Society for Pro- 
jective Techniques. 


APA CONVENTION 


INTERNATIONAL PROGRAMS 
One of the highlights of the 1959 APA Con- 
vention in Cincinnati will be a Round Table 
on “Reciprocal Influences in International 
Psychology,” scheduled for Saturday after- 
noon, September 5, from 2-4 p.m., in the Ball- 
room of the Hotel Sinton. Jointly sponsored by 


APA Divisions 1, 9, and 12, the Round Table 
will also have representation from the Inter- 
american Society of Psychology and the Society 
for Projective Techniques. Participants and 
their areas of discussion include Drs. Gordon 
W. Allport (Africa), Eugene L. Geier (Scan- 
dinavia) , Roger Myers (Canada), Nevitt San- 
ford (Eastern Europe), and Emanuel Schwartz 
(Latin America) , plus two visiting psycholo- 
gists from abroad (still to be announced) . Dr. 
Henry P. David is Chairman. Members and 
guests planning to attend the APA Conven- 
tion will also be encouraged to participate 
from the floor. 

Immediately following the Round Table, 
APA will again sponsor an Open Coffee Hour 
for Visitors From Other Lands in the Crystal 
Room of the Hotel Sinton. As before, the In- 
ternational Council of Women Psychologists 
has been asked to arrange hostess service. 


MEMBERSHIP DECISIONS 


The Society for Projective Techniques is 
pleased to announce the following member- 
ship decisions. 


Fellows: 
Farberow, Norman L. Roe, Anne 
Garfield, S. L. Rosenzweig, Saul 


Sanford, R. Nevitt 
Shapiro, David 
Singer, Erwin 
Snowden, Robert F. 
Stein, Morris I. 
Tomkins, Silvan S. 
Treat, Wolcott C. 


Halpern, Florence 
Holtzman, Wayne H. 
Horn, Daniel 
Kalinkowitz, B. N. 
Lazovik, David A. 
Lindzey, Gardner 
Little, Kenneth B. 
Molish, Herman B. 


Associates: 


Akavia, Uriel Jortner, Sidney 
Berliner, Hildegard Klein, Louis S. 
David, Charlotte Liutkus, Stankey 
Ehrenreich, Gerald A. Reens, Renee 
Fabrikant, Benjamin Schon, Martha 
Greenberg, Nathan Scott, Edward 
Haworth, Mary R. Tada, Harou 
Hoshino, Akira Tanaka, Fuzio 
Isaacs, Mark Zeev, Bracha 


Student Affiliates: 


Donoghue, John 
Kleban, Morton 


Schoenborn, Elizabeth 


BOOKS FOR REVIEW 


The Executive Editor invites offers to re 
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view the following books. 

Bass, B. M. & Berg, I. A. (Eds.) Objective 
approaches to personality assessment. 

Bellak, L., Ostow, M., Pumpian-Mindlin, E., 
Stanton, A. H., & Szasz, T. S. Conceptual and 
methodological problems in psychoanalysis. 

Caligor, L. A new approach to figure draw- 
ing. (With test materials for the Eight Card 
Redrawing Test.) 

Delay, J., Pichot, P., Lemperiere, T. & Perse, 
J. The Rorschach and the epileptic person- 
ality. 

Harrower, Molly. Personality change and 
development (As measured by the projective 
techniques.) 

Orr, Miriam. Le test de Rorschach et 
Vimago maternelle. 

Szondi, L., Moser, U. & Webb, M. W. The 
Szondi Test in diagnosis, prognosis and treat- 
ment. 


377 
ERRATA 


Three errors exist in the paper by Arthur 
B. Silverstein. Identification with same-sex and 
opposite-sex figures in thematic apperception. 
J. proj. Tech., 1959, 23, 73-75. 

All errors are in Table 1. The value of t¢ for 
Male Succorance should be 1.59, not 1.61. The 
value of r, Opposite-Sex for Female Domi- 
nance should be +-.32, not —.32. The value of 
t for Female Dominance (2.61) is significant at 
the .05 level, and should be starred. 


If anyone wishes to dispose of back issues of 
the Journal, foreign psychologists who lack the 
funds to purchase them would appreciate re- 
ceiving them. Specifically, volumes 19 through 
21 are missing from the library of Mieczyslaw 
Choynowski, Pracownia Psychologiczna, Szpi- 
tal w Kobierzynie, Krakow 18, Poland. 


Research Exchange 
A Comment on the Wiener-Nichols Controversy (2: 3,4, 5) 


RoserT R. Hott 
Research Center for Mental Health, New York University 


To what a sad pass we have come when we 
must invoke their putative morale-building 
value as justification for projective techniques! 
Of course it is true that a professional person’s 
habitual activities, and the accustomed heft of 
the tools of his trade, give him the reassurance 
of familiarity and renew his sense of profes- 
sional identity and continuity with the past. 
But these are essentially non-rational consider- 
ations, and ones that can hardly play any 
major role in important decisions, such as 
whether projective techniques should be aban- 
doned, should be subjected to intensified re- 
search, or the like. Doubtless the imposing 
beard of the Victorian physician was a great 
comfort to him, but that was not enough to 
preserve it when it was suspected of being a 
germ-carrier. Likewise, we have to find much 
more substantial merits in projective tech- 
niques than the non-rational ones cited by 
Weisskopf-Joelson and Nichols, to justify our 
keeping them. 

This little controversy, which started out 
with a reasonable proposal by Wiener that we 
ought to work harder on getting people to 
agree on interpretations, seems to have been 
overexpanded by Nichols’ rather extreme 
views. A column like this is hardly the place to 
debate fundamental issues about projective 
techniques or to make flat assertions like the 
two in Nichols’ contribution. 


Let’s stick to the issue, because Wiener’s 
point was a good one but was nearly lost in 
the hue and cry. Actually, we need to start 
further back than diagnoses, predictions or 
other integrative statements; there are entirely 
too few studies on the reliability of basic scor- 
ing. Having been involved in an attempt to 
devise scoring schemes that would guide clini- 
cal judgment in making predictions (1), I’m 
very much aware of how hard it is to get agree- 
ment on scores that require any degree of 
clinical judgement. Whether to persist 
through the laborious stages of developing and 
revising procedures for dealing with projective 
techniques or turn rather to more and more 
ingenious, objectively scorable tests is a deci- 
sion that each of us has to make for himself in 
relation to a particular problem. For any spe- 
cific research task, like predicting suitability of 
persons for an occupational role, it may be 
that the long-run usefulness and values of 
what can be developed will repay the effort; 
for routine diagnostic use, the situation is 
quite different. It is practically impossible to 
find objective instruments that will do much 
for us there, since we're not predicting a finite 
number of clearly specifiable criteria. 

One of the ironic paradoxes in this situation 
is that the really research-minded people who 
might be willing to take up issues such as the 
one Wiener proposes are generally not much 
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interested in devoting their efforts to homely 
goals like improving the usefulness of clinical 
techniques. And, on the other hand, the good 
clinicians who are deeply interested in their 
instruments and how well they work often 
don’t feel comfortable enough with research 
methods to do the job, either. Yet sooner or 
later we have to supplement the oral tradition 
of working with projective tests by research- 
derived data and principles. 

If I may contribute a personal reflection to 
this discussion, I’d like to veer off in another 
direction and defend the proposition that pro- 
jective techniques and other diagnostic tests 
are, and most usefully ought to remain, com- 
parable to the interview and the autobiogra- 
phy as sources of clinical data. We hardly need 
better instruments for gathering data; rather, 
we need most of all a better conceptual frame- 
work in terms of which to conceptualize the 
richness of these data. Thus, even though I 
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would favor studies of the kind Wiener advo- 
cates, I predict that in the long-run we won’t 
make very much progress with projective tech- 
niques until we have some major advances in 
the theory of personality. 
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